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SEQUENCE LISTING 

< 1 1 j > Yocum, R. et al . 

<12j> METHODS AND MICROORGANISMS FOR PRODUCTION OF 
PANTO-COMPOUNDS 

<13 0> BGI-14 1CP 

< 1 4 0 > 
<14 L> 

<150> USSN 09. '400, 494 
< 1 5 1 > 19 9 9-0 9-21 

<150> USSN 60. -'210 , 07 2 

< 1 5 1 > 2 0 DO - 0 6 - 0 7 

<150> USSN 6 D. -'221 , 336 
<15 1> 1000 -07-2 8 

<15 0> USSN 60/221 ,33 6 

< 1 5 1 > 2 0 0 0 - 0 3 - 2 4 

< 1 6 0 > 9 4 

<170> Pacentln Ver. 2.0 

<210> 1 
<211> 311 
<212> PRT 

<213> Haemophilus influenzae 
<4 00> 1 

Met Glu Phe Ser Thr Gin Gin Thr Pro Phe Leu Ser Phe Asn Arg Glu 
15 10 15 

Gin Trp Ala Glu Leu Arg Lys Ser Val Pro Leu Lys Leu Thr Glu Gin 

2 0 2 5 3 0 

Asp Leu Lys Pro Leu Leu Gly Phe Asn Glu Asp Leu Ser Leu Asp Glu 
35 40 45 

Val Ser Thr lie Tyr Leu Pro Leu Thr Arg Leu lie Asn Tyr Tyr lie 
5 0 5 5 6 0 

Asp Glu Asn Leu His Arg Gin Thr Val Leu His Arg Phe Leu Gly Arg 
65 70 75 80 

Asn Asn Ala Lys Thr Pro Tyr lie lie Ser lie Ala Gly Ser Val Ala 

85 90 95 

Val Gly Lys Ser Thr Ser Ala Arg lie Leu Gin Ser Leu Leu Ser His 

100 105 110 

Trp Pro Thr Glu Arg Lys Val Asp Leu lie Thr Thr Asp Gly Phe leu 
115 12 0 125 

Tyr Pro Leu Asn Lys Leu Lys Gin Asp Asn Leu Leu Gin Lys Lys Gly 
13 0 135 140 
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Phe Pro Val Ser Tyr Asp Thr Pro Lys Leu He Arg Phe Leu Ala Asp 
145 15 0 155 16 0 

Val Lys Ser Gly Lys Ser Asn Val Thr Ala Pro He Tyr Ser His Leu 

16 5 170 175 

Thr Tyr Asp lie He Pro Asp Lys Phe Asp Val Val Asp Lys Pro Asp 

18 0 13 5 19 0 

lie Leu He Leu Glu Gly Leu Asn Val Leu Gin Thr Gly Asn Asn Lys 

195 20 0 2 0 5 

Thr Asp Gin Thr Phe Val Ser Asp Phe Val Asp Phe Ser He Tyr Val 
210 215 220 

Asp Ala Glu Glu Lys Leu Leu Lys Glu Trp Tyr lie Lys Arg Phe Leu 

225 23 0 235 240 

Lys Phe Arg Glu Ser Ala Phe Asn Asp Pro Asn Ser Tyr Phe Lys His 

24 5 250 255 

Tyr Ala Ser Leu Ser Lys Glu Glu Ala He Ala Thr Ala Ser Lys He 

2 6 0 2 6 5 2 7 0 

Trp Asp Glu He Asn Gly Leu Asn Leu Asn Gin Asn lie Leu Pro Thr 
2 7 5 2 8 0 2 3 5 

Arg Glu Arg Ala Asn Leu He Leu Lys Lys Gly His Asn His Gin Val 
290 295 300 

Glu Leu He Lys Leu Arg Lys 
3 0 5 3 10 



<210> 2 
<211> 316 
<212> PRT 

< 2 1 3 > Escherichia col i 
<400> 2 

Met Ser He Lys Glu Gin Thr Leu Met Thr Pro Tyr Leu Gin Phe Asp 
15 10 15 

Arg Asn Gin Trp Ala Ala Leu Arg Asp Ser Val Pro Met Thr Leu Ser 

2 5 3 0 



2 0 



Glu Asp Glu He Ala Arg Leu Lys Gly He Asn Glu Asp Leu Ser Leu 

3 5 4 0 4 5 

Glu Glu Val Ala Glu He Tyr Leu Pro Leu Ser Arg Leu Leu Asn Phe 

50 55 60 

Tyr He Ser Ser Asn Leu Arg Arg Gin Ala Val Leu Glu Gin Phe Leu 

6 5 7 0 7 5 8 0 

Gly Thr Asn Gly Gin Arg lie Pro Tyr He He Ser He Ala Gly Ser 

f : 5 9 0 9 5 

Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala Leu Leu 

1 0 0 10 5 11 0 
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Ser Arg Trp Pro Glu His Arg Arg Val Glu Leu lie Thr Thr Asp Gly 

115 120 125 

Phe Leu His Pro Asn Gin Val Leu Lys Glu Arg Gly Leu Met Lys Lys 

13 0 13 5 14 0 

Lys Gly Phe Pro Glu Ser Tyr Asp Met: His Arg Leu Val Lys Phe Val 

145 150 155 160 

Ser Asp Leu Lys Ser Gly Val Pro Asn Val Thr Ala Pro Val Tyr Ser 

165 17 0 175 

His Leu lie Tyr Asp Val Tie Pro Asp Gly Asp Lys Thr Val Val Gin 

130 185 190 

Pro Asp lie Leu lie Leu Glu Gly Leu Asn Val Leu Gin Ser Gly Met 

19 5 2 0 0 2 0 5 

Asp Tyr Pro His Asp Pro His His Val Phe Val Ser Asp Phe Val Asp 

210 215 2 2 0 

Phe Ser lie Tyr Val Asp Ala Pro Glu Asp Leu Leu Gin Thr Trp Tyr 

22 5 230 235 240 

lie Asn Arg Phe Leu Lys Phe Arg Glu Gly Ala Phe Thr Asp Pro Asp 

245 25 0 255 

Sor Tyr Phe His Asn Tyr Ala Lys Leu Thr Lys Glu Glu Ala lie Lys 

260 265 270 

Thr Ala Met Thr Leu Trp Lys Glu lie Asn Trp Leu Asn Leu Lys Gin 

275 23 0 285 

Asn lie Leu Pro Thr Arg Glu Arg Ala Ser Leu lie Leu Thr Lys Ser 

2 9 0 2 9 5 3 0 0 

Ala Asn His Ala Val Glu Glu Val Arg Leu Arg Lys 

3 05 310 315 



<110> 3 

<211> 319 

<212> PRT 

<213> Bacillus subtil is 

<400> 3 

Mot Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
1 5 10 15 

Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser lie Ala Val Ser 

20 25 30 

Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

Glu Glu Val Glu Thr lie Tyr lie Pro Leu Val Arg Leu Leu His Leu 
5 0 5 5 6 0 

His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
6 5 7 0 7 5 9 0 



BGM41CP -4- 





Lys His Pro His Ser Ala Lys lie Pro Phe lie lie Gly lie Ala Gly 

3 5 9 0 9 5 

Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 

100 105 110 

Lou Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu lie Thr Thr Asp 
115 12 0 125 

Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 
130 135 140 

Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe 
145 150 155 160 

Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 

165 170 175 

Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe Glu Val Val Glu 

180 135 190 

Gin Ala Asp lie Val He He Glu Gly He Asn Val Leu Gin Ser Pro 
195 20 0 205 

Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 

210 215 220 

Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
225 230 235 240 

Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 

245 250 255 

Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala 

260 265 270 

Asp Glu Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 

Tyr Glu Asn He Leu Pro Thr Lys Phe Arg Ser Asp Leu He Leu Arg 
290 295 3 00 

Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 
3 05 310 315 



<210> 4 
<211> 312 
<212> PRT 

<213> Mycobacterium leprae 
< 4 0 0 > 4 

Met Pro Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp Arg Lys 
15 10 15 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu Glu 

20 25 30 



Glu Leu He Gly Leu Arg Gly Leu Gly Glu Gin He Asp Leu Leu Glu 
35 40 45 
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Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu lie His Leu Gin Val 

5 0 5 5 6 0 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 

65 70 75 8 0 

Pro Gin Gin Asn Pro Gly Arg Pro Val Pro Phe lie lie Gly Val Ala 

3 5 9 3 9 5 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 

100 105 110 

Leu Leu Ala Arg Trp Asp His His Thr Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gly Arg Arg Asn Leu Met 
13 0 13 5 14 0 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ala Asp Tyr Ala Cys Ala Pro Val 

165 170 175 

Tyr Ser His Leu Arg Tyr Asp Thr lie Pro Gly Ala Lys His Val Val 

180 185 190 

Arg His Pro Asp lie Leu lie Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 22 0 

Asp Ala Arg lie Gin Asp lie Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Gly Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 

245 250 255 

Tyr Ser Ala Leu Thr Asp Ser Lys Ala lie lie Ala Ala Arg Glu lie 

260 265 270 

Trp Arg Ser lie Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 
275 2 30 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 
290 295 30 0 

Asn Arg Leu Arg Leu Arg Lys Leu 

3 05 310 



<210> 5 

< 2 1 1 > 3 12 

<2 12> PRT 

<213> Mycobact e r i uitl t ubercu 1 os i s 



Glu Phe Asp Arg Arg 

15 



< 4 0 0 > 5 

Met Ser Arg Leu Ser Glu Pro Ser Pro Tyr Val 
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Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu Ala Leu Thr Glu Glu 

2 0 2 5 3 0 



Glu Leu Val Gly Leu Arg Gly Leu Gly Glu Gin lie Asp Leu Leu Glu 
3 5 4 0 4 5 



Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu lie His Leu Gin Val 

5 0 5 5 6 0 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
6 5 7 0 7 5 8 0 

Pro Gin Gin Asn Pro Asp Arg Pro Val Pro Phe lie lie Gly Val Ala 

3 5 9 0 9 5 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 

100 105 110 

Leu Leu Ala Arg Trp Asp His His Pro Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gin Arg Arg Asn Leu Men 
13 0 13 5 14 0 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
14 5 150 15 5 160 

Phe Val Thr Ser Val Lys Ser Gly Ser Asp Tyr Ala Cys Ala Pro Val 

165 170 175 

Tyr Ser His Leu His Tyr Asp lie lie Pro Gly Ala Glu Gin Val Val 

180 185 190 

Arg His Pro Asp lie Leu lie Leu Glu Gly Leu Asn Val Leu Gin Thr 

195 20 0 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg lie Glu Asp lie Glu Gin Trp Tyr Val Ser Arg Phe Leu 

225 23 0 235 240 

Ala Met Arg Thr Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 

245 250 255 

Tyr Ala Ala Phe Ser Asp Ser Gin Ala Val Val Ala Ala Arg Glu lie 

260 265 270 

Trp Arg Thr lie Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 

275 280 285 

Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 

2 9 0 2 9 5 3 0 0 



Asn Arg Leu Arg Leu Arg Lys Leu 
3 0 5 3 10 



<2 10> 6 
<211> 329 
<2I2> PRT 
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<213> Streptomyces coelicolor 
< 4 0 0 > 6 

Met lie Ser Pro Val Pro Ser lie Pro Arg Ser Ala His Arg Gin Arg 

15 10 15 

Pro Glu Ala Thr Pro Tyr Val Asp Leu Thr Arg Pro Glu Trp Ser Ala 

2 0 2 5 3 0 

Leu Arg Asp Lys Thr Pro Leu Pro Leu Thr Ala Glu Glu Val Glu Lys 
3 5 4 0 4 5 

Leu Arg Gly Leu Gly Asp Val lie Asp Leu Asp Glu Val Arg Asp lie 

5 0 5 5 6 0 

Tyr Leu Pro Leu Ser Arg Leu Leu Asn Leu Tyr Val Gly Ala Thr Asp 

6 5 7 3 7 5 3 0 

Gly Leu Arg Gly Ala Leu Asn Thr Phe Leu Gly Glu Gin Gly Ser Gin 

8 5 9 0 9 5 

Ser Gly Thr Pro Phe Val lie Gly Val Ala Gly Ser Val Ala Val Gly 

10 0 10 5 110 

Lys Ser Thr Val Ala Arg Leu Leu Gin Ala Leu Leu Ser Arg Trp Pro 
LIS 120 125 

Glu His Pro Arg Val Glu Leu Val Thr Thr Asp Gly Phe Leu Leu Pro 
13 0 135 140 

Thr Arg Glu Leu Glu Ala Arg Gly Leu Met Ser Arg Lys Gly Phe Pro 



i-i^ 15 0 155 16 3 



1 1 c; 



Glu Ser Tyr Asp Arg Arg Ala Leu Thr Arg Phe Val Ala Asp He Lys 

165 170 175 

Ala Gly Lys Ala Glu Val Thr Ala Pro Val Tyr Ser His Leu He Tyr 

13 0 13 5 19 0 

Asp He Val Pro Asp Gin Arg Leu Val Val Arg Arg Pro Asp He Leu 

19 5 2 00 2 05 

He Val Glu Gly Leu Asn Val Leu Gin Pro Ala Leu Pro Gly Lys Asp 
210 215 220 

Gly Arg Thr Arg Val Gly Leu Ala Asp Tyr Phe Asp Phe Ser Val Tyr 
225 230 235 240 

Val Asp Ala Arg Thr Glu Asp He Glu Arg Trp Tyr Leu Asn Arg Phe 

245 250 255 

Arg Lys Leu Arg Ala Thr Ala Phe Gin Asn Pro Ser Ser Tyr Phe Arg 

260 265 270 

Lys Tyr Thr Gin Val Ser Glu Glu Glu Ala Leu Asp Tyr Ala Arg Thr 
2 7 5 2 3 0 2 S 5 

Thr Trp Arg Thr He Asn Lys Pro Asn Leu Val Glu Asn Val Ala Pro 
2 9 3 2 9 5 3 ^ 



'hr Arg Gly Arg Ala 2nr Leu Val Leu Arg Lys Gly Pro Asp His Ly 



s 
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3 05 310 315 32 0 

Val Gin Arg Leu Ser Leu Arg Lys Leu 

325 



< 2 1 Q > 7 
<211> 265 
<212> PRT 

<2 13> Streptomyces coeli color 
-:400> 7 

Met Leu Leu Thr lie Asp Val Gly Asn Thr His Thr Val Leu Gly Leu 
15 10 15 

Phe Asp Gly Glu Asp lie Val Glu His Trp Arg lie Ser Thr Asp Ser 

20 25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu Met Gly 
3 5 4 0 4 5 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp Gly lie Asp Gly lie 
5 0 5 5 6 0 

Ala lie Gys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
6 5 7 0 7 5 3 0 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val Glu Pro Gly 

85 90 95 

Veil Lys Thr Gly Val Pro lie Leu Thr Asp His Pro Lys Glu Val Gly 

100 10 5 110 

Ala Asp Arg lie lie Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 

Pro Ala lie Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Ala Val 
130 135 140 

Ser Ala Arg Gly Glu Tyr lie Gly Gly Val lie Ala Pro Gly lie Glu 
145 150 155 160 

lie Ser Val Glu Ala Leu Gly Val Lys Gly Ala Gin Leu Arg Lys lie 

165 170 175 

Glu Val Ala Arg Pro Arg Ser Val lie Gly Lys Asn Thr Val Glu Ala 

180 185 190 

Met Gin Ser Gly lie Val Tyr Gly Phe Ala Gly Gin Val Asp Gly Val 
195 200 205 

Val Asn Arg Met Ala Arg Glu Leu Ala Asp Asp Pro Asp Asp Val Thr 

210 215 22 0 

Val lie Ala Thr Gly Gly Leu Ala Pro Met Val Leu Gly Glu Ser Ser 

2 25 230 235 2 4 j 

Val lie Asp Glu His Glu Pro Trp Leu Thr Leu Met Gly Leu Arg Leu 

245 25 0 255 



Val jyr Glu Arg Asn Val Ser Arg Me 
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<210> 8 

<::n> 272 

<212> PRT 

<213> Mycobacterium tuberculosis 

<<;oo> 8 

Met Leu Leu Ala lie Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
1 5 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg lie 

2 0 2 5 3 0 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr lie Asp 
3 5 4 0 4 5 

Gly Leu lie Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 

5 0 5 5 6 0 

Ser Thr Val Pro Ser Val Leu His Glu Val Arg lie Met Leu Asp Gin 
6 5 7 0 7 5 8 0 

Tyr Trp Pro Ser Val Pro His Val Leu lie Glu Pro Gly Val Arg Thr 

3 5 9 0 9 5 

Gly lie Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 

1 0 0 10 5 110 

lie Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala lie 
115 12 0 125 

Val Val Asp Phe Gly Ser Ser lie Cys Val Asp Val Val Ser Ala Lys 
130 135 14 0 

Gly Glu Phe Leu Gly Gly Ala lie Ala Pro Gly Val Gin Val Ser Ser 
145 150 155 160 

Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val Glu Leu Ala 

165 170 175 

Arg Pro Arg Ser Val Val Gly Lys Asn Thr Val Glu Cys Met Gin Ala 

13 0 13 5 19 0 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val Gly Arg 
19 5 2 0 0 2 0 5 

lie Arg Glu Asp Val Ser Gly Phe Ser Val Asp His Asp Val Ala lie 

210 215 22 0 

Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu Leu His Thr 

225 230 235 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu Gin Gly Leu Arg Leu Val 

o * c "> °i ~ ^ ^ 

2 h- !D !D 

Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 
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<210> 9 
<211> 253 
<2 12> PRT 

■:213> Bacillus subtilis 
<400> 9 

Leu Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 

20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu ?he Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie lie Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 8 0 

Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 

LOO 105 110 

Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 L25 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 

165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 

130 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 

195 2 00 205 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 

210 215 220 

Pro Leu He Ala Asn Glu Ser Asp Cys He Asp He Val Asp Pro Phe 
225 230 235 24 0 

Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 

245 250 255 



Ser Val 
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< 2 1 0 > 10 
<2 11- 2 62 
<212-> ?RT 

<2 13'» Deinococcus radiopugnans 
<4 00> 10 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp lie Gly Asn Thr Thr Thr 
1 5 ID 15 

Val Leu Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 

2 0 2 5 3 0 

lie Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu Gin Leu 
35 40 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro lie Pro Arg Ala Ala Val 
5 0 5 5 6 0 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 

6 5 7 0 7 5 3 0 

Lys Arg His Phe Met lie Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 

8 5 9 0 9 5 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 

10 0 10 5 110 

Arg Leu Gys Asn Leu Phe Gly Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
115 120 125 

Tyr Ala Val Val Val Asp Phe Gly Thr Ser Thr Asn Phe Asp Val Val 
13 0 13 5 14 0 

Gly Arg Gly Arg Arg Phe Leu Gly Gly lie Leu Ala Thr Gly Ala Gin 
145 150 155 160 

Val Ser Ala Asp Ala Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg lie 

165 170 175 

Thr Leu Gin Ala Pro Glu Thr Ala lie Gly Lys Asn Thr Val His Ala 

180 185 190 

Leu Gin Ser Gly Leu Val Phe Gly Tyr Ala Glu Met Val Asp Gly Leu 
195 2 00 205 

Leu Arg Arg lie Arg Ala Glu Leu Pro Gly Glu Ala Val Ala Val Ala 
210 215 22 0 

Thr Gly Gly Phe Ser Arg Thr Val Gin Gly lie Cys Gin Glu lie Asp 
225 230 235 240 

Tyr Tyr Asp Glu Thr Leu Thr Leu Arg Gly Leu Val Glu Leu Trp Ala 

245 2 5 0 255 

Ser Arg Ser Glu Val Arg 

260 
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< 2 1 3 > De suif o v i 

< 4 0 0 > 11 

Met Thr Gin His 
1 

lie Gly lie Ala 

2 0 

Thr Asp Pro Gly 
35 

Val Leu Arg His 

50 

Ala S e r S e r Va 1 
6 5 

Glu Arg Tyr Leu 



lie Pro Leu Asp 

1 0 0 

Arg Leu Val Ala 
115 

S e r Leu Va 1 Ser 
130 

Glu Gly Gly Ala 
14 5 

Ser Ala Gly Ala 



Leu Glu Val Glu 

180 

Ser Leu Asn His 
195 

Val Leu Ala Ala 

210 



rio vulgaris 



Phe Leu Leu Phe 
5 

Val Glu Thr Ala 



Gin Thr Thr Asp 

4 0 

Ala Gly Leu Gly 

55 

Val Pro Gly Val 

7 0 

Tyr Arg Lys Leu 

85 

Asn Arg Tyr Glu 



Ala Tyr Ala Ala 

12 0 

Val Asp Phe Gly 

135 

Tyr Leu Gly Gly 
150 

Leu Ser Ser Arg 
165 

Glu Asp Ser Pro 



Gly Phe lie Phe 

20 0 



Asp lie Gly Asn 
J. j 

Val Leu Thr Ser 
25 

Ser lie Gly Leu 



Pro Ala Asp Val 

6 0 

Asn Pro Leu lie 
75 

Leu Phe Ala Pro 
9 0 

Arg Pro Ala Glu 
105 

Arg Arg Leu Tyr 



Thr Ala Thr Thr 

14 0 

Leu lie Cys Pro 

155 

Thr Ala Lys Leu 
17 0 

Val lie Gly Arg 
185 

Gly Phe Ala Ala 



Thr Asn Val Lys 

15 

Tyr Val Leu Pro 
3 0 

Arg Leu Leu Glu 
45 

Gly Ala Cys Val 



Arg Arg Ala Cys 

3 0 

Gly Asp lie Ala 
95 

Val Gly Ala Asp 
110 

Pro Gly Pro Arg 
12 5 

Phe Asp Cys Val 



Gly Val Leu Ser 

160 

Pro Arg lie Ser 
17 5 

Ser Thr Thr Thr 
190 

Met. Thr Glu Gly 

205 



<210> 12 

<211> 246 

<2 12> PRT 

<213> Thermotoga maritima 

<400> 12 

Met Tyr Leu Leu Val Asp Val Gly Asn Thr His Ser Val Phe Ser lie 
1 5 10 15 

Thr Glu Asp Gly Lys Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 

2 0 2 5 3 0 

Phe Gin Thr Glu Asp Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
3 5 1C 



BGI-14ICP 



- 13 - 



Asp Ala Met Arg Glu lie Lys Gly lie Gly Val Ala Ser Val Val Pro 

5 3 5 5 6 0 

Thr Gin Asn Thr Val lie Glu Arg Phe Ser Gin Lys Tyr Phe His lie 

6 5 7 3 7 5 3 0 

Sor Pro lie Trp Val Lys Ala Lys Asn Gly Gys Val Lys Trp Asn Val 

85 93 95 

Lys Asn Pro Ser Glu Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 

10 0 10 5 110 

Phe Val Lys Glu Tyr Gly Lys Asn Gly lie lie lie Asp Met Gly Thr 
115 12 0 125 

Ala Thr Thr Val Asp Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
13 0 13 5 14 0 

He Leu Pro Gly Phe Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
14 5 150 155 160 

A.La Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 

165 173 175 

Lys Asp Thr Glu Glu Asn He Arg Leu Gly Val Val Asn Gly Ser Val 

18 0 13 5 19 0 

Tyr Ala Leu Glu Gly lie He Gly Arg lie Lys Glu Val Tyr Gly Asp 

19 5 2 0 0 2 05 

Leu Pro Val Val Leu Thr Gly Gly Gin Ser Lys He Val Lys Asp Met 

213 215 2 2 0 

lie Lys His Glu lie Phe Asp Glu Asp Leu Thr lie Lys Gly Val Tyr 
22 5 23 0 23 5 2 40 

His Phe Cys Phe Gly Asp 

2 45 



<210> 13 

<211> 273 

<212> PRT 

<213> Treponema pallidum 

< 4 0 0 > 13 

Met Leu Leu He Asp Val Gly Asn Ser His Val Val Phe Gly He Gin 
1 5 10 15 

Gly Glu Asn Gly Gly Arg Val Gys Val Arg Glu Leu Phe Arg Leu Ala 

2 0 2 5 3 0 

Pro Asp Ala Arg Lys Thr Gin Asp Glu Tyr Ser Leu Leu He His Ala 
3 5 4 0 4 5 

Leu Cys Glu Arg Ala Gly Val Gly Arg Ala Ser Leu Arg Asp Ala Phe 

5 J 5 5 6 3 

He Ser Ser Val Val Pro Val Leu Thr Lys Thr lie Ala Asp Ala Val 

C ' 'J / 3 t? J 
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Ala Gin lie Ser Gly Val Gin Pro Val Val Phe Gly Pro Trp Ala Tyr 

3 5 9 0 9 5 

Glu His Leu Pro Val Arg He Pro Glu Pro Val Arg Ala Glu He Gly 

100 105 110 

Thr Asp Leu Val ALa Asn Ala Val Ala Ala Tyr Val His Phe Arg Ser 
115 12 J 125 

Ala Cys Val Val Val Asp Cys Gly Thr Ala Leu Thr Phe Thr Ala Val 
133 135 140 

Asp Gly Thr Gly Leu He Gin Gly Val Ala He Ala Pro Gly Leu Arg 
145 150 155 16 0 

Thr Ala Val Gin Ser Leu His Thr Gly Thr Ala Gin Leu Pro Leu Val 

165 17 0 175 

Pro Leu Ala Leu Pro Asp Ser Val Leu Gly Lys Asp Thr Thr His Ala 

18 0 13 5 19 0 

Val Gin Ala Gly Val Val Arg Gly Thr Leu Phe Val He Arg Ala Mel 

195 20 0 20 5 

He Ala Gin Cys Gin Lys Glu Leu Gly Cys Arg Cys Ala Ala Val He 

210 215 220 

Thr Gly Gly Leu Ser Arg Leu Phe Ser Ser Glu Val Asp Phe Pro Pro 
2 2 5 23 0 23 5 240 

He Asp Ala Gin Leu Thr Leu Ser Gly Leu Ala His He Ala Arg Leu 

24 5 2 5 0 255 

Val Pro Thr Ser Leu Leu Pro Pro Ala Thr Val Ser Gly Ser Ser Gly 

2 6 0 2 6 5 2 7 0 

Asn 



<210> 14 
<211> 262 
<212> PRT 

<2 13 > Borrel ia burgdorferi 
< 4 0 0 > 14 

Met Asn Lys Pro Leu Leu Ser Glu Leu He He Asp He Gly Asn Thr 
15 10 15 

Ser He Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe He 

2 0 2 5 3 0 

Lys Met. Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Va 1 Tyr Ser Phe 
3 5 4 0 4 5 

Phe Glu Glu Asn Phe Asp Phe Asn Va 1 Asn Lys Va 1 Phe He S«=?r Ser 

5 0 5 5 6 0 

Val Val Pro He Leu Asn Glu Thr Phe Lys Asn Val He Phe Ser Phe 

6 5 7 0 7 5 SO 
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Phe Lys lie Lys Pro Leu Phe lie Gly ?he Asp Leu Asn Tyr Asp Leu 

3 5 9 0 9 5 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 

10 D 10 5 110 

Phe Ala Asn Leu Val Ala Ala lie Glu Asn Tyr Ser Phe Glu Asn Val 
115 1 2 '0 12 5 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr lie Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly lie Leu Gly Gly lie lie Asn Ser Gly Pro Leu lie Asn 
14 5 150 155 160 

Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu lie Lys Lys Phe Pro lie 

165 17 0 175 

Ser Thr Pro Asn Asn Leu Leu Glu Arg Thr Thr Ser Gly Ser Val Asn 

13 0 18 5 19 0 

Ser Gly Leu Phe Tyr Gin Tyr Lys Tyr Leu lie Glu Gly Val Tyr Arg 
19 5 2 0 0 2 0 5 

Asp lie Lys Gin Met Tyr Lys Lys Lys Phe Asn Leu lie lie Thr Gly 

210 215 2 2 0 

Gly Asn Ala Asp Leu lie Leu Ser Leu lie Glu He Glu Phe lie Phe 

2 2 5 2 3 0 2 3 5 2 4 0 

Asn lie His Leu Thr Val Glu Gly Val Arg lie Leu Gly Asn Ser lie 

245 25 0 2 55 

Asp Phe Lys Phe Val Asn 

2 6 0 



<210> 15 

<211> 229 

<212> PRT 

< 2 1 3 > A qui f ex aeolicus 

<400> 15 

Met Arg Phe Leu Thr Val Asp Val Gly Asn Ser Ser Val Asp lie Ala 
1 5 10 15 

Leu Trp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His Glu 

2 0 2 5 3 0 

Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu Gly lie Ser 
3 5 4 0 4 5 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys lie Pro Lys lie 

5 0 5 5 6 0 

Lys Phe Leu Lys Lys Glu Asn Phe Pro lie Gin Val Asp Tyr Lys Th: 

6 5 7:] 7 5 SO 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Lys 

8 5 9 0 9 5 
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Lys Phe Tyr Gly Lys Asn Val Val Val lie Ser Ala Gly Thr Ala Leu 

10J 105 110 

Val lie Asp Leu Val Leu Glu Gly Lys Phe Lys Gly Gly Phe lie Thr 
115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys lie Leu Ser Asp Leu Ala Glu Gly 
13 3 13 5 14 J 

lie Pro Glu Phe Phe Pro Glu Glu Val Glu lie Phe Leu Gly Arg Ser 
145 15 3 155 160 

Thr Arg Glu Cys Val Leu Gly Gly Ala Tyr Arg Glu Ser Thr Glu Phe 

165 17 0 17 5 

lie Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe Lys Arg Lys Phe 

1 S 0 18 5 19 0 

Lys Val Val lie Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
19 5 2 0 0 2 0 5 

lie Tyr Asp Pro Leu Leu Val His Arg Gly Met Arg Asn Leu Leu Tyr 
2 10 215 2 2 0 

Leu Tyr His Arg lie 
225 



<210> 16 
<211> 2 57 
<212> PRT 

<213> Synechocystis sp . 
< 4 0 0 > 16 

Met Glu Thr Ser Lys Pro Gly Cys Gly Leu Ala Leu Asp Asn Asp Lys 
15 10 15 

Gin Lys Pro Trp Leu Gly Leu Met lie Gly Asn Ser Arg Leu His Trp 

2 0 2 5 3 0 

Ala Tyr Cys Ser Gly Asn Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
3 5 4 0 4 5 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 



-J 



0 5 5 6 0 



Met Leu Ala Ser Val Val Pro Glu Gin Thr Glu Val Trp Arg Val Tyr 
6 5 7 0 7 5 8 0 

Gin Pro Lys lie Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 

8 5 9 0 9 5 

Pro Ser Phe Gly lie Asp Arg Ala Leu Ala Gly Leu Gly Thr Gly Leu 

10 0 1 0 5 110 

Thr Tyr (Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 12 0 12 5 

lie Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala lie Leu 
.30 13 5 14 0 
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Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
145 15 3 155 161 

Leu Pro Lys Leu Glu Me: Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 

165 1 7 C 175 

Ala Leu Asp Thr Pro Ser Ala lie Phe Ser Gly Val Val Tyr Gly Va 1 

13 0 13 5 19 0 

Leu Gly Ala Leu Gin Ser Tyr Leu Gin Asp Trp Gin Lys Leu Phe Pro 
19 5 2 0 0 2 0 5 

Gly Ala Ala Mec Val lie Thr Gly Gly Asp Gly Lys lie Leu His Gly 

210 215 2 2 0 

Phe Leu Lys Glu His Ser Pro Asn Leu Ser Val Ala Trp Asp Asp Asr: 
2 2 5 2 3 0 2 3 5 2 4: 

Leu lie Phe Leu Gly Met Ala Ala lie His His Gly Asp Arg Pro lie 

2 4 5 2 5 G 2 5 5 



Cys 



< 2 1 0 > 17 
<211> 223 
<212> PRT 

<113> Helicobacter pylori 

< 4 0 0 > 17 

Met: Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
1 5 10 15 

Asp lie Gly Asn Thr Arg lie His Phe Ala Gin Asn Tyr Gin Leu Phe 

2 0 2 5 3 0 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly lie Gin Lys Glu lie 
3 5 4 0 4 5 

Phe Tyr lie Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys 

5 0 5 5 6 0 

Tyr Pro Asn Ala Lys Asn lie Ala Gly Phe Phe His Leu Glu Thr Asp 
6 5 7 0 7 5 8 

Tyr Val Gly Leu Gly lie Asp Arg Gin Met Ala Cys Leu Ala Val Asn 

8 5 9 C 9 5 

Asn Gly Val Val Val Asp Ala Gly Ser Ala lie Thr lie Asp Leu lie 

10 0 10 5 110 

Lys Glu Gly Lys His Leu Gly Gly Cys lie Leu Pro Gly Leu Ala Gin 
115 12 0 12 5 

Tyr lie His Ala Tyr Lys Lys Ser Ala Lys lie Leu Glu Gin Pro Phe 

1 3 0 13 5 14 0 

Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 

X 3 i^J ' ^ ^ lcJ 



BGM41CP 




S 




Val Asn Tyr Gly Met Val Leu Ser Val He Ala Cys He Gin His Leu 

165 170 175 



Ala Lys Asn Gin Lys lie Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu 



1 3 



L 6 D 



19 D 



Ser Ala Phe 
195 



,eu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 

2 0 0 2 0 5 



ly Met Glu lie Ala Leu Lys Lys Ala Gly lie Leu Glu Cys Lys 

210 215 220 



<210> 18 

<211> 267 

<2 12> PRT 

<2 13> Bordetella pertussis 

<400> 18 

Met lie He Leu He Asp Ser Gly Asn Ser Arg Leu Lys Val Gly Trp 

1 5 10 15 

Phe Asp Pro Asp Ala Pro Gin Ala Ala Arg Glu Pro Ala Pro Val Ala 

2 0 2 5 3 0 

Phe Asp Asn Leu Asp Leu Asp Ala Leu Gly Arg Trp Leu Ala Thr Leu 
3 5 4 0 4 5 

Pro Arg Arg Pro Gin Arg Ala Leu Gly Val Asn Val Ala Gly Leu Ala 
5 0 5 5 6 0 

Arg Gly Glu Ala He Ala Ala Thr Leu Arg Ala Gly Gly Cys Asp He 

o 5 7 0 7 5 8 0 

Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 

8 5 9 0 9 5 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 

10 0 10 5 11 0 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr lie Gly Pro Asp Asn Val Phe 
13 0 13 5 1 4 j 

Pro Gly Gly Leu He Leu Pro Gly Pro Ala Met Met Arg Gly Ala Leu 
145 150 15 5 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 

165 17 0 175 

Tyr Pro He Asp Thr His Gin Ala He Ala Ser Gly He Ala Ala Ala 

18J 185 19 D 

Gin Ala Gly Ala He Val Arg Gin Trp Leu Ala Gly Arg Gin Arg Tyr 

19 5 2 0 0 2 0 5 



Gly Gin Ala Pro Glu He Ty r Val Ala G 1 y G 1 y Gly Trp Pro Glu Va 1 
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Arg Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 

225 233 235 243 

Ala Thr Pro Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 

245 250 255 

Ala Ala Leu Ala Ala Gin Gly Ala Pro Thr Ala 

2 6 0 2 6 5 



< 2 1 0 > 19 
<211> 777 

< 2 1 2 > DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> ( 1 ) . . ( 774 ) 

< ■; o o > 19 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 

Leu Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

tat. cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 9 6 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 

2 0 2 5 3 0 

cat. aaa aca gaa gat gag ttt ggg atg att ttg ego tec tta ttt gat 144 
Hls Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
3 5 4 0 4 5 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie lie Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 240 
Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 

6 5 7 0 7 5 3 0 

ttt cat ate gag ect eaa att gtt ggt cca ggt atg aaa acc ggt tta 2 33 
Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gae aga ate gta 336 
Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 

10 0 10 5 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gte 3 34 
Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu lie Val Val 

115 12 0 L25 

gat tte gga acc gee aca acg tac tgc tat att gat gaa aac aaa caa 432 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 

13 0 13 5 14 3 

tac atg ggc ggg gcg att gee cet ggg att aca att teg aca gag gcg 480 
Tyr Met Gly Gly Ala He Ala Pro Gly lie Thr He Ser Thr Glu Ala 
14= 150 155 163 

ctt tac teg cgt gea gca aag ctt ect cgt ate gaa ate acc egg eee 
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Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 

165 170 175 

gac a at att ate gga aaa aac act gtt age gcg atg eaa tct gga a 1 1 5 7 6 

Asp Asn lie lie Gly Lys Asn Thr Va 1 Ser Ala Met Gin Ser Gly lie 

180 185 190 

tta ttt ggc tat gtc ggc caa g tg gaa gga ate gtt aag cga atg aaa 624 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly lie Val Lys Arg Met Lys 

195 200 235 

tgg cag gca aaa cag gac etc aag gtc a 1 1 gcg aca gga ggc ctg gcg 672 

Trp Gin Ala Lys Gin Asp Leu Lys Val lie Ala Thr Gly Gly Leu Ala 

210 215 220 

ccg etc att gcg aac gaa tea gat tgt ata gac ate gtt gat cca ttc 720 

Pro Leu lie Ala Asn Glu Ser Asp Gys lie Asp lie Val Asp Pro Phe 

22 5 23 0 235 240 

tta ace eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 768 

Leu Thr Leu Lys Gly Leu Glu Leu lie Tyr Glu Arg Asn Arg Val Gly 

245 250 255 

agt gta tag 77 7 

Ser Val 



<H0> 2 0 

<2 1.1> 9 60 

<::i2-> DNA 

<.l 1.3 * Bacillus subtilis 

<2 2H GDS 

<2 22> (1) . . (957) 

<400> 20 

gtg aaa aat aaa gaa ctt aac eta cat act tta tat aca cag cac aat 48 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 

15 10 15 

egg gag tct tgg tct ggt ttt ggg ggg cat ttg teg att get gta tct 96 
Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser He Ala Val Ser 

20 25 30 

gaa gaa gag gca aaa get gtg gaa gga ttg aat gat tat eta tct gtt 144 
Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

gaa gaa gtg gag acg ate tat att ccg ctt gtt cge ttg ctt cat tta 192 
Glu Glu Val Glu Thr He Tyr He Pro Leu Val Arg Leu Leu His Leu 

5 0 5 5 6 3 



cat gtc aag tct gcg get gaa cgc aat aag cat gtc aat gtt ttt ttg 240 

Ser Ala Ala Glu Arq Asn Lys His Val Asn Val Phe Leu 

7 0 7 5 8 0 



aag cac cca cat tea gee aaa att ccg ttt att ate ggc att gec ggc 288 
Lys His Pro His Ser Ala Lys He Pro Phe He He Gly He Ala Gly 

c 5 9 0 9 5 
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agt gtc gca gtc gga aaa age acg acg gcg egg ate ttg eag aag ctg 336 

Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg lie Leu Gin Lys Leu 

10 0 13 5 110 

c:t ccg cgt ttg cct gac egt cea aaa gtg age ctt ate aeg aea gat 334 

Leu Ser Arg Leu Pro Asp Arg Pro Lys Val Ser Leu lie Thr Thr Asp 

115 120 125 

ggt ttt tta ttt cct act gec gag etg aaa aag aaa aat atg atg tea 43 2 

Gly ?he Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 

1 3 j 13 5 14 0 

aga aaa gga ttt cct gaa age tat gat gta aag geg etg etc gaa ttt 48 0 

Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Le lu Phe 

145 150 155 160 

ttg aat gac tta aaa tea gga aag gac age gta aag gee ccg gtg tat 52 8 

Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys A La Pro Val Tyr 

165 170 175 

tec eat eta ace tat gac egc gag gaa ggt gtg tec gag gtt gta gaa 57 6 

Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe Glu Val Val Glu 

18 0 13 5 19 0 

cag gcg gat att gtg att att gaa ggc att aat gtt ctt eag teg cec 624 

Gin Ala Asp lie Val lie lie Glu Gly lie Asn Val Leu Gin Ser Pro 

195 200 205 

ace ttg gag gat gac egg gaa aac ccg cgt att ttt gtt tee gat ttc 672 

Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg lie Phe Val Ser Asp Phe 

215 220 

ttt gat ttt teg att tat gtg gat geg gag gaa age egg att ttc act 72 0 

Phe Asp Phe Ser lie Tyr Val Asp Ala Glu Glu Ser Arg lie Phe Thr 

225 23 0 235 24 0 

tgg tat tta gag cgt ttt egc etg ett egg gaa aca get ttt eaa aat 76 8 

Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 

245 250 255 

cct gat tea tat ttt cat aaa ttt aaa gac ttg tec gat eag gag get 316 

Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala 

26i^ 265 27 0 

gac gag atg gca gec teg att tgg gag agt gtc aac egg ccg aat tta 864 

Asp Glu Met Ala Ala Ser lie Trp Glu Ser Val Asn Arg Pro Asn Leu 

2 7 5 2 3 0 2 3 5 

tat gaa aat att ttg cca act aaa ttc agg tea gat etc att ttg cgt 912 

Tyr Glu Asn lie Leu Pro Thr Lys Phe Arg Ser Asp Leu lie Leu Arg 

29 j 295 300 

aag gga gac ggg cat aag gtc gag gaa gtg ttg gta agg agg gta tga 960 

Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 

3 10 3 15 



< 2 1 0 



<2 1 



.—l 

> f ^ ^ 

<2 12> DXA 
<213> 3ac 



1 .US SUD' 



.IS 



BCiI 1 4 1C P 





< 2 2 0 > 

<221> CDS 

<222:> ( 1 ) . . ( 3 7 9 ) 

<4 0 0> 21 

ttg teg att get gta tct gaa gaa gag gca aaa gee gtg gaa gga ttg 43 

Met Ser lie Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 

15 10 15 

aat gat tat eta tet gtt gaa gaa gtg gag acg ate tat att eeg ctt 95 

Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie Tyr lie Pro Leu 

2 0 2 5 3 0 

gtt ege ttg ett cat tta cat gtc aag tct geg get gaa egc aat aag 144 

Val Arg Leu Leu His Leu His Val Lys Ser Ala Aia Glu Arg Asn Lys 

3 5 4 0 4 5 

cat gtc aat gtt ttt ttg aag cac eea eat tea gee aaa att eeg ttt 192 

His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys lie Pro Phe 

5 0 5 5 6 0 

ant ate ggc art gee ggc agt gtc gca gtc gga aaa age acg acg gcg 240 

lie lie Gly lie Aia Gly Ser Val Aia Val Gly Lys Ser Thr Thr Ala 

6 5 7 0 7 5 3 0 

egg ate ttg cag aag ctg ctt teg egt ttg cet gac cgt eea aaa gtg 2 38 

Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 

3 5 9 0 9 5 

age ctt ate acg aea gat ggt ttt tta etc cct act gec gag ctg aaa 336 

Ser Leu lie Thr Thr Asp Giy Phe Leu Phe Pro Thr Aia Glu Leu Lys 

10 0 10 5 11 0 

aag aaa aat atg atg tea aga aaa gga ttt cet gaa age tat gat gta 3 34 

Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 

115 120 125 

aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea gga aag gae age 43 2 

Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 

13 0 135 140 

gta aag gee eeg gtg tat tec cat eta acc tat gac cgc gag gaa ggt 430 

Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 

145 153 155 16 0 

gtg tte gag gtt gta gaa cag gcg gat att gtg att att gaa ggc att 523 

Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly lie 

165 170 175 

aat gtt ctt cag teg cce ace ttg gag gat gac egg gaa aae eeg egt 576 

Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 

1 3 0 13 5 19 3 

att ttt gtt tee gat tte ttt gat ttt teg att tat gtg gat geg gag 624 

He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr Val Asp Aia Glu 

19 5 2 0 0 2 3 5 

gaa age egg att tte act tgg tat tta gag cgt ttt ege ctg :tt egg 672 

Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg 



BGI-141CT 




gaa aca get ttt caa aat cct gat tea tat ttt cat aaa ttt aaa gac 720 

Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
2 25 23 0 235 24 0 

ttg tec gat cag gag get gac gag atg gca gee teg att tgg gag agt 768 

:ou Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser lie Trp GLu Ser 

245 250 255 

gtc aac egg ccg aat tta tat gaa aat att ttg cca act aaa ttc agg 316 

Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro Thr Lys Phe Arg 

260 265 270 

tea gat etc att ttg cgt aag gga gac ggg cat aag gtc gag gaa gtg 364 

Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys Val Glu Glu Val 
275 280 2 35 

ttg gta agg agg gta tga 382 

Leu Val Arg Arg Val 

29 0 



<210> 22 

< 2 1 1 > 3 4 6 
<212> DNA 

<2l3> Bacillus subtilis 

< 2 2 0 > 
<221> GDS 

<2 22> ( 1 ) . . ( 8 4 3 ) 

<400> 22 

gtg gaa gga ttg aat gat tat eta tct gtt gaa gaa gtg gag acg ate 4 3 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie 

L 5 10 15 

tat att ccg ctt gtt cgc ttg ctt cat tta cat gtc aag tct gcg get 96 
Tyr lie Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 

20 25 30 

gaa cgc aat aag cat gtc aat gtt ttt ttg aag cac cca cat tea gec 144 
Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
35 40 45 

aaa att ccg ttt att ate gge att gec ggc agt gtc gca gtc gga aaa 192 
Lys lie Pro Phe lie lie Gly lie Ala Gly Ser Val Ala Val Gly Lys 
5 0 5 5 6 0 

age acg acg gcg egg ate ttg cag aag ctg ctt teg cgt ttg cct gac 240 
Ser Thr Thr Ala Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pre* Asp 

65 70 75 80 

cgt cca aaa gtg age ctt ate acg aca gat ggt ttt tta ttt cct act 233 
Arg Pro Lys Val Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr 

3 5 9 0 9 5 

gee gag ctg aaa aag aaa aat atg atg tea aga aaa gga ttt cct gaa 336 
Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 

\J 'J *Z) -i- _1_ W 

age tat gat gta aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea 3^4 
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Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 

115 120 125 

gga aag gac age gta aag gec ccg gtg tat tec cat eta ace tat gac 4 32 

Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
13 0 135 140 

cgc gag gaa ggt gtg ttc gag gtt gta gaa cag gcg gat att gtg att 430 

Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp lie Val lie 
14 5 150 155 16 0 

att gaa ggc att aat gtt ctt cag teg ccc acc ttg gag gat gac egg 523 

He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 

165 170 175 

gaa aac ccg cgt att ttt gtt tec gat ttc ttt gat ttt teg att tat 576 

Glu Asn Pro Arg lie Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr 

180 135 190 

gtg gat gcg gag gaa age egg att ttc act tgg tat tta gag cgt ttt 624 

Val Asp Ala Glu Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe 

19 5 2 0 0 2 0 5 

cgc ctg ctt egg gaa aca get ttt caa aat cct gat tea tat ttt cat 672 

Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 
210 215 2 2 0 

aaa ttt aaa gac ttg tec gat cag gag get gac gag atg gca gee teg 72 0 

Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 24 0 

att tgg gag agt gtc aac egg ccg aat tta tat gaa aat att ttg cca 76 3 

lie Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro 

245 250 255 

act aaa ttc agg tea gat etc att ttg cgt aag gga gac ggg cat aag 316 

Thr Lys Phe Arg Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys 

260 265 270 

gtc gag gaa gtg ttg gta agg agg gta tga 846 

Val Glu Glu Val Leu Val Arg Arg Val 

275 280 



<210> 23 
< 2 1 1 > 3 31 
<212^ DNA 

<213> Bacillus subtil is 

<220> 

<22 1 > GDS 

<222> ( 1 ) . . ( 3 3 1 ) 

<400> 23 

atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 4 3 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 

1 5 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 9 6 
lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 
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caa gcg gga gtt gac atg att tta gtc ggt gat tea ctt gga atg gtc 144 

Gin Ala Gly Val Asp Met lie Leu Val Gly Asp Ser Leu Gly Met Val 

3 5 4 0 4 5 

gtc etc ggc ctt gat tea act gtc ggt gtg aea gtt gcg gac atg ate 192 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met lie 

50 55 60 

cat cat aca aaa gec gtt aaa agg ggt gcg ccg aat acc ttt att gtg 2 4 J 

His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe lie Val 

65 70 75 8 0 

aca gat atg ccg ttt atg tct tat cae ctg tct aag gaa gat acg ctg 28 3 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys G lu Asp Thr Leu 

8 5 9 0 9 5 

aaa aat gca gcg get ate gtt cag gaa age gga get gac gea ctg aag 33 6 

Lys Asn Ala Ala Ala lie Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 

10 0 L 0 5 110 

ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt 3 84 

Leu Glu Gly Gly Glu Gly Val Phe Glu Ser lie Arg Ala Leu Thr Leu 

115 123 125 

gga ggc att cca gta gtc agt cae tta ggt ttg aca ccg cag tea gtc 4 3 2 

Gly Gly lie Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 

130 135 140 

ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa caa age gee 480 

Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 

145 150 155 16 0 

aaa aaa tta a ta gaa gac agt ata aaa tge gaa gaa gca gga get atg 518 

Lys Lys Leu lie Glu Asp Ser lie Lys Cys Glu Glu Ala Gly Ala Met 

165 170 175 

atg ctt gtg ctg gaa tgt gtg ccg gca gaa etc aca gee aaa att gee 576 

Met Leu Val Leu Glu Gys Val Pro Ala Glu Leu Thr Ala Lys lie Ala 

13 0 13 5 19 0 



gag acg eta age ata ccg gtc att gga ate ggg get ggt gtg aaa gcg o24 

Glu Thr Leu Ser lie Pro Val lie Gly lie Gly Ala Gly Val Lys Ala 
195 200 205 

gac gga caa gtt etc gtt tat cat gat att ate ggc cae ggt gtt gag 672 

Asp Gly Gin Val Leu Val Tyr His Asp lie lie Gly His Gly Val Glu 
210 215 220 

aga aca ect aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 72 0 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg lie Asp Glu Thr lie 

225 230 235 2 4 D 

gaa aca gca ate age gga tat gtt cag gat gta aga cat c g t get 1 1 c 763 

Glu Thr Ala Tie Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 

245 253 -55 



ect gaa caa aag cat tec ttt :aa atg aae cag aca gtg ctt gac ggc 
Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 



260 



2 6 5 2 7 0 
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t t g tac ggg gga aaa 8 31 

Leu Tyr Gly Gly Lys 



275 



<210> 24 
<J11> 277 
<212> PRT 

<2 13> Bacillus subtilis 
<4 00> 2 4 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
15 ID 15 

lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 

2 J 2 5 3 0 

G^n Ala Gly Val Asp Met lie Leu Val Gly Asp Ser Leu Gly Met Val 
3 5 4 0 4 5 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
5 0 5 5 6 0 

H.:.s His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe lie Val 
6 5 7 0 7 5 8 J 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 

8 5 9 0 9 5 

Lys Asn Ala Ala Ala lie Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 

1 0 0 10 5 110 

Leu Glu Gly Gly Glu Gly Val Phe Glu Ser lie Arg Ala Leu Thr Leu 
115 120 125 

Gly Gly lie Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 

Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
14 5 150 15 5 16 0 

Lys Lys Leu lie Glu Asp Ser lie Lys Gys Glu Glu Ala Gly Ala Met 

165 170 175 

Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys lie Ala 

1 .-. 0 18 5 19 0 

Glu Thr Leu Ser lie Pro Val lie Gly lie Gly Ala Gly Val Lys Ala 
195 200 205 

Asp Gly Gin Val Leu Val Tyr His Asp lie lie Gly His Gly Val Glu 
210 215 22 0 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg lie Asp Glu Thr He 

2 2 c 23 0 235 24) 

Glu Thr Ala lie Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 

245 253 255 

Pro Glu G 1 n Ly s His Ser Phe G 1 n Me t Asn Gin Thr Va 1 Leu Asp Gly 
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Leu Tyr Gly Gly Lys 
275 



<210> 25 

< 2 1 1 > 8 5 3 

< 2 1 2 > DNA 

<213> Bacillus subtilis 

< 2 2 0 > 

<2 21> CDS 

<2 22> { 1 ) . . ( 8 5 3 ) 

<400> 2 5 

atg aga cag at: act gat att tea cag ctg aaa gaa gec ata aaa caa 48 

Met Arg Gin lie Thr Asp lie Ser Gin Leu Lys Glu Ala lie Lys Gin 
1 5 10 15 

tac cat tea gag ggc aag tea ate gga ttt gtt ccg acg atg ggg ttt 96 
Tyr His Ser Glu Gly Lys Ser lie Gly Phe Val Pro Thr Met Gly Phe 

2 3 2 5 3 0 

ctg cat gag ggg cat tta acc tta gca gac aaa gca aga caa gaa aac 144 
Leu His Glu Gly His Leu Thr Leu Ala Asp Lys Ala Arg Gin Glu Asn 
35 40 45 

gac gec gtt att atg agt att ttt gtg aat cct gca caa ttc ggc cct 192 
Asp Ala Val lie Met Ser lie Phe Val Asn Pro Ala Gin Phe Gly Pro 
5 0 5 5 6 0 

aat gaa gat ttt gaa gca tat ccg cgc gat att gag egg gat gca get. 2 40 
Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp lie Glu Arg Asp Ala Ala 
6 5 7 0 7 5 8 0 

ctt gca gaa aac gec gga gtc gat att ctt ttt acg cca gat get cat 2 38 
Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 



8 5 9 0 9 



gat atg tat ccc ggt gaa aag aat gtc acg att eat gta gaa aga cgc 33 6 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr lie His Val Glu Arg Arg 

10 0 10 5 110 

aca gac gtg tta tgc ggg cgc tea aga gaa gga eat ttt gac ggg gtc 3 34 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 

115 12 0 125 

gcg ate gta ctg acg aag ctt ttc aat eta gtc aag ccg act cgt gec 43 2 

Ala lie Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 

13 0 13 5 14 0 

tat ttc ggt tta aaa gat gcg cag cag gta get gtt gtt gat ggg tta 480 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 

145 15 0 155 16 0 

ate age gac ttc ttc atg gat att gaa ttg gtt cct gtc gat acg gtc b2 8 

lie Ser Asp Phe Phe Met Asp lie Glu Leu Val Pro Val Asp Thr Val 

165 170 175 

aga gag gaa gac ggc tta gee aaa age tet cgc aat gta tac tta aca 57 6 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 
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130 185 19 0 

get gag gaa aga aaa gaa gcg cct aag ctg tat egg gec ctt caa aca 624 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 

195 2 00 205 

agt gcg gaa ctt gtc caa gec ggt gaa aga gat cct gaa gcg gtg ata 672 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val lie 

210 215 220 

aaa get gca aaa gat ate att gaa acg act age gga ace ata gac tat 720 

Lys Ala Ala Lys Asp lie lie Glu Thr Thr Ser Gly Thr lie Asp Tyr 

225 230 235 24 0 

gta gag ctt tat tec tat ccg gaa etc gag cct gtg aat gaa att get 763 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu lie Ala 

245 250 255 

gga aag atg att etc get gtt gca gtt get ttt tea aaa gcg egt tta 816 

Gly Lys Met lie Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 

260 265 270 

ata gat aat ate att att gat att ega gaa atg gag aga ata 853 

lie Asp Asn lie lie lie Asp lie Arg Glu Met Glu Arg lie 

275 28 0 2 85 



<::io> 2 6 

<211> 2 86 
<212> PRT 

<213> Bacillus subtil is 
<400> 26 

Met Arg Gin lie Thr Asp lie Ser Gin Leu Lys Glu Ala lie Lys Gin 
15 10 15 

Tyr His Ser Glu Gly Lys Ser lie Gly Phe Val Pro Thr Met Gly Phe 

20 25 30 

Leu His Glu Gly His Leu Thr Leu Ala Asp Lys Ala Arg Gin Glu Asn 
35 40 45 

Asp Ala Val lie Met Ser lie Phe Val Asn Pro Ala Gin Phe Gly Pro 
50 55 60 

Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp lie Glu Arg Asp Ala Ala 
6 5 7 0 7 5 3 0 

Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 

85 90 95 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr lie His Val Glu Arg Arg 

10 0 105 110 

Thr Asp Val Leu Gys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 
115 12 0 12d 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 
130 135 14 0 



V r Phe Gly Leu Ly s Asp Ala (Sin Gin 7a 1 Ala Va 1 Va 1 Asp Gly 
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14 



150 



1E-5 



160 



lie Ser Asp Phe Phe Me: Asp lie GIu Leu Val Pro Val Asp Thr Val 

165 17 0 175 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asr. Val Tyr Leu Thr 

18 0 13 5 19 0 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 

19 5 2 00 2 D5 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val lie 

210 215 2 2 C 

Lys Ala Ala Lys Asp lie lie Glu Thr Thr Ser Gly Thr lie Asp Tyr 

22 5 230 23 5 240 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu lie Ala 

245 250 255 

Gly Lys Met lie Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 

26 0 265 270 



lie Asp Asn lie lie lie Asp lie Arg Glu Met Glu Arg lie 

2 7 5 2 3 0 2 3 5 



< 2 1 0 > 2 7 

< 2 1 1 > 3 8 1 

< 2 1 2 > DNA 

< 2 1 3 > Bac 1 1 lus sub: ills 

<220> 

<22 1> GD3 

<222> (1) . . (331) 

<400> 27 

a:g tat cga aca atg atg age ggc aaa ctt cac agg go a act gtt acg 4 8 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
1 5 10 15 

gaa gca aac ctg aac tat gtg gga age att aca att gat gaa gat etc 9 6 
Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr lie Asp Glu Asp Leu 

20 25 30 

att gat get gtg gga atg ctt cct aat gaa aaa gta caa att gtg aat 144 
lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
35 40 45 



aat aat aat gga gca cgt ctt gaa acg tat att att cct ggt aaa egg 192 

Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie lie Pro Gly Lys Arg 

5 0 5 5 6 0 

gga age ggc gtc a ta tgc t ta aac ggt gca gee gca cgc ctt gtg cag 24 0 

Gly Ser Gly Val lie Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 

6b 7 0 7E 3 0 

gaa gga gat aag gtc att att att tec tac aaa atg atg tct gat eaa 2 3 3 

Glu Gly Asp Lys Val lie lie lie Ser Tyr Lys Met Met Ser Asp Gin 

3 5 9 0 9 5 
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gaa gcg gca age cat gag ccg aaa gtg get gtt ctg aat gat caa aac 336 

Glu Ala Ala Ser His Glu Pro Lys Val Ala Val Leu Asn Asp Gin Asn 

1 0 0 105 110 

aaa att gaa caa atg ctg ggg aac gaa cca gec cgt aca att ttg 3 81 

Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr lie Leu 

115 11": 0 125 



<:.10> 28 
<211> 127 
<::12> PRT 

<213> Bacillus subtilis 
<4 0 0> 2 8 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
15 10 15 

Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr He Asp Glu Asp Leu 

2 0 2 5 3 0 

He Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
3 5 4 0 4 5 

Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr He He Pro Gly Lys Arg 

5 0 5 5 6 0 

Gly Ser Gly Val He Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 

6 5 7 0 7 5 8 0 

Glu Gly Asp Lys Val He He lie Ser Tyr Lys Met Met Ser Asp Gin 

85 90 95 

Glu Ala Ala Ser His Glu Pro Lys Val Ala Val Leu Asn Asp Gin Asn 

10 0 10 5 11 0 

Lys He Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr He Leu 
115 12C 125 



<2 10> 2 9 

< 2 1 1 > 8 9 4 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<2 2 2~> ( 1 ) . . ( 3 9 4 ) 

< 4 0 0 ■> 2 9 

atg aaa att gga att ate ggc gga ggc tec gtt ggt ctt tta tgc gec 48 

Met Lys He Gly He He Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 

1 5 10 15 

tat tat ttg tea ctt tat eac gac gtg act gtt gtg acg agg egg caa 9 6 
Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val. Thr Arg Arg Gin 

2D 25 3 0 

gaa rag get gcg gec att tag tot gaa gga ate egg ctt tat aaa ggc 144 
Glu Gin Ala Ala Ala lie Gin Ser Glu Gly lie A r g Leu Ty r Lys G 1 y 
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ggg gag gaa ttc agg get gat tgc agt gcg gac acg agt ate aat teg 192 

Giy Glu GIu Phe Arg Ala Asp Cys Ser Ala Asp Thr 3er lie Asn Ser 

5 3 5 5 6 0 

gac ttt gac ctg ctt gtc gtg aca gtg aag cag cat eag ctt caa tct 2 4") 

Asp Phe Asp Leu Leu Val Va 1 Thr V al Lys Gin His Gin Leu Gin Ser 

6 5 7 0 7 5 3 3 

gtt ttt teg teg ctt gaa cga ate ggg aag acg aat ata tta ttt ttg 2 3-! 

Val Phe Ser Ser Leu Glu Arg lie Gly Lys Thr Asn lie Leu Phe Leu 

85 93 95 

caa aae ggc atg ggg cat ate eac gac eta aaa gac tgg eae gtt gge 33 6 

Gin Asn Gly Met Gly His lie His Asp Leu Lys Asp Trp His Val Gly 

100 135 110 

cat tec att tat gtt gga ate gtt gag cae gga get gta aga aaa teg 3 34 

His Ser lie Tyr Val Gly lie Val Glu His Gly Ala Val Arg Lys Ser 

115 12 0 12 5 

gat aca get gtt gat cat aca gge eta ggt gcg ata aaa tgg age gcg 43 2 

Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala lie Lys Trp Ser Ala 

13 0 13 5 140 

ttc gac gat get gaa eca gac egg ctg aac ate ttg ttt cag eat aae 4 3 0 

Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn lie Leu Phe Gin His Asn 

145 15 0 155 16 0 

eat teg gat ttt ccg att tat tat gag acg gat tgg tae egt ctg ctg 52 8 

His Ser Asp Phe Pro lie Tyr Tyr Glu Thr Asp Trp Tyr Arg Leu Leu 

16 5 17 0 17 5 

acg ggc aag ctg att gta aat gcg tgt att aat cct tta act gcg tta 57 6 

Thr Gly Lys Leu lie Val Asn Ala Cys lie Asn Pro Leu Thr Ala Leu 

13 0 13 5 19 0 

ttg caa gtg aaa aat gga gaa ctg ctg aca acg cca get tat ctg get 624 

Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 

19 5 2 0 0 2 0 5 

ttt atg aag ctg gta ttt cag gag gca tgc cgc att tta aaa ctt gaa 672 

Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg lie Leu Lys Leu Glu 

2 1 (3 2 15 2 2 0 

aat gaa gaa aag get tgg gag egg gtt eag gec gtt tgt ggg caa acg 72 0 

Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly Gin Thr 

225 230 235 24 0 

aaa gag aat egt tea tea atg ctg gtt gac gtc att gga gge egg eag 76 3 

Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val lie Gly Gly Arg Gin 

245 250 255 

acg gaa get gac gec att ate gga tac tta ttg aag gaa gca agt ctt 816 

Thr Glu Ala Asp Ala lie lie Gly Tyr Leu Leu Lys Glu Ala Ser Leu 

260 265 27 3 

caa ggt ctt gat gec gtc cac eta gag ttt tta tat ggc age ate aaa 864 

Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr Gly Ser lie Lys 

275 28 0 285 
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gca ttg gag cga aat aca aac aaa gtc ttt 894 
Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



< 2 1 0 > 3 0 
<211> 29 3 
<212> PRT 

<213> Bacillus subtilis 

< 4 0 0 ~> 3 0 

Met Lys lie Gly lie lie Gly Gly Gly Ser Val Gly Leu Leu Gys Ala 
1 5 10 15 

Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg Gin 

2D 25 30 

Glu Gin Ala Ala Ala lie Gin Ser Glu Gly lie Arg Leu Tyr Lys Gly 
35 40 45 

Gly Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser lie Asn Ser 
5 0 5 5 6 0 

Asp Phe Asp Leu Leu Val Val Thr Val Lys Gin His Gin Leu Gin Ser 
6 5 7 0 7 5 3 0 

Val Phe Ser Ser Leu Glu Arg Tie Gly Lys Thr Asn lie Leu Phe Leu 

8 5 9 0 9 5 

Gin Asn Gly Met Gly His lie His Asp Leu Lys Asp Trp His Val Gly 

10 0 10 5 11 0 

His Ser lie Tyr Val Gly lie Val Glu His Gly Ala Val Arg Lys Ser 
115 120 115 

Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala lie Lys Trp Ser Ala 
13 0 13 5 14 0 

Phe Asp Asp Ala Glu Pro Asp Arg Leu Asn lie Leu Phe Gin His Asn 
145 15 0 155 160 

His Ser Asp Phe Pro lie Tyr Tyr Glu Thr Asp Trp Tyr Arg Leu Leu 

165 170 175 

Thr Gly Lys Leu lie Val Asn Ala Cys lie Asn Pro Leu Thr Ala Leu 

15 0 13 5 19 0 

Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 

195 200 2 05 

Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg lie Leu Lys Leu Glu 

210 215 22 0 

Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly Gin Thr 
215 23 0 235 240 



Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val lie Gly Gly Arg Glr 

245 25 0 255 



w G 



lu Ala Asp Ala lie lie Gly Tyr Leu Leu Lys Glu Ala Ser Le - 
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Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr Giy Ser lie Lys 

275 280 28 5 

Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
2 9 0 2 9 5 



<2 10-> 3 1 

< 2 1 1 > 1725 
<2 12 > DNA 

<213 > Bacillus subtilis 

< 2 2 0 ■> 
<221> CDS 

<222 > ( 1 ) . . ( 1722 ) 

<400> 31 

atg ggg act aat gta cag gtg gat tea gca tct gec gaa tgt aca cag 48 

Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 

1 5 10 15 

acg atg age gga gca tta atg ctg att gaa tea tta aaa aaa gag aaa 9 6 
Thr Met Ser Gly Ala Leu Met Leu lie Glu Ser Leu Lys Lys Glu Lys 

20 25 30 

gta gaa atg ate ttc ggt tat ccg ggc ggg get gtg ctt ccg att tac 144 
Val Glu Met lie Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro lie Tyr 
35 40 45 

gat aag eta tac aat tea ggg ttg gta cat ate ctt ccc cgt cac gaa 192 
Asp Lys Leu Tyr Asn Ser Gly Leu Val His lie Leu Pro Arg His Glu 
5 0 5 5 6 0 

caa gga gca att cat gca gcg gag gga tac gca agg gtc tec gga aaa 240 
Gin Gly Ala lie His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 75 8 0 

ccg ggt gtc gtc att gec acg tea ggg ccg gga gcg aca aac ctt gtt 2 88 
Pro Gly Val Val lie Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 

8 5 9 0 9 5 

aca ggc ctt get gat gec atg att gat tea ttg ccg tta gtc gtc ttt 336 
Thr Gly Leu Ala Asp Ala Met lie Asp Ser Leu Pro Leu Val Val Phe 

10 0 105 110 

aca ggg cag gta gca acc tct gta ate ggg age gat gca ttt cag gaa 3 84 
Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 



115 120 12 



gca gac att tta ggg att acg atg cca gta aca aaa cac age tac cag 432 

Ala Asp lie Leu Gly He Thr Met Pro Val Thr Lys His Ser Tyr Gin 

13 0 135 140 

gtt cgc cag ccg gaa gat ctg ccg cgc ate at-" aaa gaa gcg ttc cat 48 0 

Val Arg Gin Pro Glu Asp Leu Pro Arg He He Lys Glu Ala Phe His 

145 150 155 160 

att gca aca act gga aga ccc gga cct gta ttg att gat att ccg aaa 523 

lie Ala Thr Thr Gly Arg Pro Gly Pro Val Leu lie Asp lie Pro Lys 

^ O ^ 1 / J 1 / 'D 



BGI- 1 4 1CP 




- 34 - 




gat gta gca aca att gaa gga gaa tec age tac gat cat gag atg aat 57 
Asp Val Ala Thr lie Glu Gly Glu Phe Ser Tyr Asp His Glu Met Asn 

13 3 18 5 19 0 



1 f. C 



ccg gga tac cag ccg aca aca gag ccg aat tat ttg cag ate cgc 624 

Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin lie Arg 

195 2 00 205 

aag ctt gtg gaa gec gtg age agt gcg aaa aaa ccg gtg ate etg gcg 67 2 

Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val lie Leu Ala 

210 215 2 2 0 



-jy s 



7 2 J 



ggt gcg ggc gta ctg eae gga aaa gcg tea gaa gaa tta aaa aat tat 

Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 

22 5 23 0 235 240 

get gaa cag eag caa ate ect gtg gca cac ace ett ttg ggg etc gga 7 6'-: 

Ala Glu Gin Gin Gin lie Pro Val Ala His Thr Leu Leu Gly Leu Gly 

245 25 0 255 

ggc tte ccg get gac cat ccg ett tte eta ggg atg gcg gga atg cac 816 

Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 

26 0 265 270 

ggt act tat aca gec aat atg gec ett eat gaa tgt gat eta tta ate 864 

Gly Thr Tyr Thr Ala Asn Met Ala Leu His Glu Cys Asp Leu Leu lie 

275 2 30 2 35 

agt ate ggc gee egt ttt gat gac cgt gte aca gga aac ctg aaa cae 911 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 

2 9 3 2 9 5 3 0 0 

ttt gec aga aae gca aag ata gec cac ate gat att gat cca get gaa 96 j 

Phe Ala Arg Asn Ala Lys lie Ala His lie Asp lie Asp Pro Ala Glu 

3 0 5 3 1 0 3 15 3 2 0 

ate gga aaa ate atg aaa aca cag att cct gta gte gga gac age aaa 100 8 

lie Gly Lys lie Met Lys Thr Gin lie Pro Val Val Gly Asp Ser Lys 

3 2 5 3 3 0 3 3 5 

att gte etg eag gag etg ate aaa caa gac ggc aaa eaa age gat tea 1056 

lie Val Leu Gin Glu Leu lie Lys Gin Asp Gly Lys Gin Ser Asp Ser 

3 40 3 45 3 50 

age gaa tgg aaa aaa eag etc gca gaa tgg aaa gaa gag tat ccg etc 1134 

Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 

355 360 365 

tgg tat gta gat aat gaa gaa gaa ggt ttt aaa cct eag aaa ttg att 1152 

Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu lie 

37 0 375 3 30 

gaa tat att eat eaa ttt aca aaa gga gag gec att gte gca acg gat 12 00 

Glu Tyr He His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 

3 H 5 3 9 J 3 9 5 4 0 3 

gta ggc eag eat eaa atg tgg tea gcg caa ttt tat ccg tte caa aaa 1243 

Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 

^ 4 1 J ^ -l !D 
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gca gat aaa tgg gtc acg tea ggc gga ctt gga acg a tg gga 1 1 c ggt 12 9-5 

Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 

420 425 430 

c 1 1 ccg gcg gcg ate ggc gca eag c tg gee gaa aaa gat get act gt t 13 4 4 

Leu Pro Ala Ala lie Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 

435 443 445 

gtc gcg gt t gtc gga gac ggc gga ttc caa atg acg ctt caa gaa etc 13 9 2 

Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 

450 455 460 

gat gtt att cge gaa tta aat ctt ccg gtc aag gta gtg att tta aat 1440 

Asp Val lie Arg Glu Leu Asn Leu Pro Val Lys Val Val lie Leu Asn 

465 470 475 48 0 

aac get tgt etc gga atg gtc aga eag tgg eag gaa att ttc tat gaa 1483 

Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu lie Phe Tyr Glu 

485 490 495 

gaa cgt tat tea gaa tct aaa ttc get tct eag cet gac ttc gtc aaa 1536 

Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 

5 0 0 5 0 5 510 

ttg tec gaa gca cac ggc att aaa ggc ate aga att tea tea gaa gcg 1584 

Leu Ser Glu Ala Tyr Gly lie Lys Gly lie Arg lie Ser Ser Glu Ala 

515 52 0 525 

gaa gea aag gaa aag ctg gaa gag gca tta aca tea aga gaa ect gtt 1632 

Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 

530 535 5 40 

gtc att gac gtg egg gtt gec age gaa gaa aaa gta ttc ccg atg gtg 1680 

Val lie Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 

545 55 0 555 56 0 

get ccg ggg aaa ggg etg eat gaa atg gtg ggg gcg aaa ect tga 1725 

Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 

565 570 



<2 10> 3 2 
<211> 574 
<212> PRT 

<2 1 3 > Bac i 1 lus subt i lis 
<400> 32 

Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
15 10 15 

Thr Met Ser Gly Ala Leu Met Leu lie Glu Ser Leu Lys Lys Glu Lys 

2 0 2 5 3 0 

Val Glu Met lie Phe Gly Tyr Pro Gly Gly Ala Val Leu Pre- He Tyr 
3 5 4 0 4 5 

Asp Lys Leu Tyr Asn Ser Gly Leu Val His lie Leu Pro Arg His Glu 

5 0 5 5 6 0 

Gin Gly Ala lie His Ala Ala Glu Gly Ty r Ala Arg Va 1 Ser Gly Lys 

>j5 7 0 7 5 SO 
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Pro Gly Val Val He Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 

8 5 9 ■:■ 9 5 

Thr Gly Leu Ala Asp Ala Met He Asp Ser Leu Pro Leu Val Val Phe 

10 0 10 5 110 

Thr Gly Gin Val Ala Thr Ser Val He Gly Ser Asp Ala Phe Gin Glu 
115 12 0 125 

Ala Asp lie Leu Gly He Thr Met: Pro Val Thr Lys His Ser Tyr Gin 
13 0' 13 5 14 0 

Val Arg Gin Pro Glu Asp Leu Pro Arg He He Lys Glu Ala Phe His 

145 150 155 16 0 

He Ala Thr Thr Gly Arg Pro Gly Pro Val Leu He Asp He Pro Lys 

16 5 17 0 17 5 

Asp Val Ala Thr He Glu Gly Glu Phe Ser- Tyr Asp His Glu Met Asn 

18 0 18 5 19 0) 

Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin He Arg 
19 5 2 0 0 2 0 5 

Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 
210 215 2 2 0 

Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 

22 5 23 0 2 3 5 24 0 

Ala Glu Gin Gin Gin He Pro Val Ala His Thr Leu Leu Gly Leu Gly 

2 4 5 2 5 0 2 5 5 

Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 

260 26 5 27 0 

Gly Thr Tyr Thr Ala Asn Met Ala Leu His Glu Cys Asp Leu Leu lie 

2 7 5 2 3 0 2 8 5 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 

2 9 0 2 9 5 3 0 0 

Phe Ala Arg Asn Ala Lys He Ala His He Asp He Asp Pro Ala Glu 

305 310 315 32 0 

He Gly Lys He Met Lys Thr Gin lie Pro Val Val Gly Asp Ser Lys 

3 2 5 3 3 0 3 3 5 

lie Val Leu Gin Glu Leu He Lys Gin Asp Gly Lys Gin Ser Asp Ser 

3 4 0 3 4 5 3 5 D 

Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 

3 5 5 3 6 0 3 6 5 

Trp Tyr Val Asp Asn Glu Glu GH Gly Phe Lys Pro Gin Lys Leu He 

3 7 J 3 7 5 3 8 0 

Glu Tyr He His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 

385 390 305 43 J 
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Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 

4 0 5 410 415 

Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 

4 2 0 4 2 5 4 3 0 

Leu Pro Ala Ala lie Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
4:5 440 445 

Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 

4 5 0 4 5 5 4 6 0 

Asp Val lie Arg Glu Leu Asn Leu Pro Val Lys Val Val lie Leu Asn 
46 5 47 0 47 5 4 80 

Asn Ala Cys Leu Gly Met Val Arg Gin Trp Gin Glu lie Phe Tyr Glu 

4 8 5 4 9 0 4 9 5 

Glu Arg Tyr Ser Glu Ser Lys Phe Ala Ser Gin Pro Asp Phe Val Lys 

5 0 0 5 0 5 510 

Leu Ser Glu Ala Tyr Gly lie Lys Gly lie Arg lie Ser Ser Glu Ala 
515 520 525 

Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 

5 3 0 5 3 5 5 4 0 

Val lie Asp Val Arg Val Ala Ser Glu Glu Lys Val Phe Pro Met Val 
545 55 0 555 5 6 0 

Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 

5 6 5 5 7 0 



<210;> 3 3 

<^11> 525 

<212> DNA 

<213> Bacillus subtilis 

<:20> 

<221> CDS 

<222> { 1 ) . . ( 522 ) 

<400> 3 3 

ttg aaa aga att ate aca ttg act gtg gtg aac cgc tec ggg gtg tta 4 8 

Met Lys Arg lie lie Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

aac egg ate acc ggt eta ttc aca aaa agg cat tac aac att gaa age 96 
Asn Arg lie Thr Gly Leu Phe Thr Lys Arg His Tyr Asn lie Glu Ser 

20 25 30 

att aca gtt gga cac aca gaa aca gec ggc gtt tec aga ate acc ttc 144 
lie Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg lie Thr Phe 
35 40 45 

gtc gtt cat gtt gaa ggt gaa aat gat gtt gaa cag tta acg aaa cag 192 
Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 

5 0 5 5 6 3 



etc aac aaa cag att gat gtg ~tg aaa gtc aca gar ate aca aat caa 
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Leu Asn Lys Gin lie Asp Val Leu Lys Val Thr Asp lie Thr Asn Gin 

6 5 7 0 7 5 3 d 

tog att gtc cag agg gag ctg gec tta ate aag gtt gtc tec gca cct 288 

Ser lie Val Gin Arg Glu Leu Ala Leu lie Lys Val Val Ser Ala Pro 

8 5 9 0 9 5 

tea aca aga aca gag att aat gga ate ata gaa ccg ttt aga gec tct 3 3 6 

Ser Thr Arg Thr Glu lie Asn Gly lie lie Glu Pro Phe Arg Ala Ser 

100 105 110 

gtc gtt gat gtc age aga gac age ate gtt gtt cag gtg aca ggt gaa 384 

Val Val Asp Val Ser Arg Asp Ser lie Val Val Gin Val Thr Gly Glu 

115 120 125 

to t aac aaa att gaa gcg c 1 1 att gag tta tta aaa cct tat ggc att 432 

Ser Asn Lys lie Glu Ala Leu lie Glu Leu Leu Lys Pro Tyr Gly lie 

130 135 14 0 

aaa gaa ate gcg aga aca ggt aca acg get ttt gcg agg gga acc age 480 

Lys Glu lie Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 

145 150 155 160 

aaa agg cgt cat cca ata aaa caa tat eta ttg tat aaa aca taa 525 

Lys Arg Arg His Pro lie Lys Gin Tyr Leu Leu Tyr- Lys Thr 

16 5 17 0 



<210> 34 

< 2 1 1 > 174 
<212> PRT 

< 2 1 3 > Bac i 1 lus subt i 1 is 
<4 0 0> 3 4 

Met Lys Arg lie lie Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

Asn Arg lie Thr Gly Leu Phe Thr Lys Arg His Tyr Asn lie Glu Ser 

2 0 2 5 3 0 

lie Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg lie Thr Phe 
3 5 4 0 4 5 

Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 

5 0 5 5 6 0 

Leu Asn Lys Gin lie Asp Val Leu Lys Val Thr Asp lie Thr Asn Gin 
65 70 75 8 0 

Ser lie Val Gin Arg Glu Leu Ala Leu lie Lys Val Val Ser Ala Pro 

8 5 9 0 9 5 

Ser Thr Arg Thr Glu lie Asn Gly lie lie Glu Pre Phe Arg Ala Ser 

10 0 105 110 



Asp Vai Ser Arg Asp Se: lie Va± Vai bin vai mr Gly Glu 
115 120 125 

Ser Asn Lys lie Glu Ala Leu lie Glu Leu Leu Lys Pro Tyr Gly lie 



13 0 13 5 



1 A 
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Lys Glu lie Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 

14 5 150 155 160 

Lys Arg Arg His Pro lie Lys Gin Tyr Leu Leu Tyr Lys Thr 

16 5 17 0 



< 2 1 0 > 3 5 

< 2 1 1 > 10 2 9 
<2 12> DNA 

<213> Bacillus subtilis 

<220> 
<221> CDS 

< 2 2 2 > ( 1 ) . . ( 1 0 2 6 ) 

<4 0 0> 3 5 

a:g gta aaa gta tat tat aac ggt gat ate aaa gag aac gta ttg get 43 

Met Val Lys Val Tyr Tyr Asn Gly Asp lie Lys Glu Asn Val Leu Ala 
15 10 15 

gga aaa aca gta gcg gtt ate ggg tac ggt teg eaa gge cac gca cat 96 
Gly Lys Thr Val Ala Val lie Gly Tyr Gly Ser Gin Gly His Ala His 

2 0 2 5 3 0 

gee ctg aac ctt aaa gaa age gga gta gac gtg ate gte ggt gtt aga 144 
Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val Tie Val Gly Val Arg 
3 5 4 0 4 5 

eaa gga aaa tct tte aet eaa gee caa gaa gac gga eat aaa gta ttt 192 
Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 55 60 

tea gta aaa gaa gcg gea gec eaa gee gaa ate ate atg gtt etg ctt 24 0 
Ser Val Lys Glu Ala Ala Ala Gin Ala Glu lie lie Met Val Leu Leu 
6 5 7 0 7 5 3 0 

eeg gat gag cag cag caa aaa gta tac gaa get gaa ate aaa gat gaa 2 38 
Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu lie Lys Asp Glu 

3 5 9 0 9 5 

ttg aca gca gga aaa tea tta gta tte get cat gga ttt aac gtg cat 336 
Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 

10 0 10 5 110 

tte cat caa att gtt ect ccg gcg gat gta gat gta tte tta gtg gec 3 34 
Phe His Gin lie Val Pre Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 

cot aaa ggc ccg gga cac ttg gta aga aga aca tat gag caa gga get 43 2 
Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr Glu Gin Gly Ala 
130 135 140 

ggc gta ect gca ttg tte gca ate tat eaa gat gtg act gga gaa gea 4 30 
Gly Val Pro Ala Leu Phe Ala lie Tyr Gin Asp Val Thr Gly Glu Ala 
145 153 155 163 

aga gac aaa gee ctr get tat get aaa gga ate gge gge gca aga gcg 52 Q 
Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly lie Gly Gly Ala Arg Ala 

155 17 0 175 
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ggc gta tta 
Gly Val Leu 



ggc gag caa 
Gly Glu Gin 
19 5 

gga Mt gaa 
Gly Phe Glu 
210 

etc gag tgt 
Phe Glu Cys 
22 5 

gaa gga ctt 
Glu Gly Leu 



gga gar ecc 
Gly Asp Phe 



tec atg aaa 
Ser Met Lys 

275 

gag egg arc 
Glu Trp He 
29 0 

aac gca age 
Asn Ala Ser 
3 05 

egt gaa atg 
Arg Glu Met 



gtc tee gtt 
Val Ser Val 



gaa acg aca 
Glu Thr Thr 

gca gtt ctt 
Ala Val Leu 



acc tta act 
Thr Leu Thr 



ctt cat gag 
Leu His Glu 

230 

gca gga atg 
Ala Gly Met 
245 

gta tea ggc 
Val Ser Gly 

2 60 

gaa gca tta 
Glu Val Leu 



gtc gaa aac 
Val Glu Asn 



gag aac gaa 

Glu Asn Glu 
310 

atg ccg ttt 

Met Pro Phe 
325 

gcg caa aat 

Ala Gin Asn 
340 



ttt aaa gaa 
Phe Lys Glu 
18 5 

tgc ggc gga 
Gys Gly Gly 

2 0 0 

gaa gca ggc 
Glu Ala Gly 
215 

ctg aaa tta 
Leu Lys Leu 

aga cat cca 
Arg Tyr Ser 



cct cgc gtc 
Pro Arg Val 
265 

aaa gat ate 
Lys Asp lie 

2 30 

caa gta aac 
Gin Val Asn 
295 

cat caa ate 
His Gin Ele 



gtg aaa caa 
Val Lys Gin 



taa 



gaa aca gaa 
Glu Thr Glu 



eta age gcg 
Leu Ser Ala 



cat cag cct 
Tyr Gin Pro 

22 0 

ate gta gac 
He Val Asp 

235 

ate tct gac 
He Ser Asp 
250 

gtg gac gec 
Val Asp Ala 



caa aac ggt 
Gin Asn Gly 



cgt cct cgc 
Arg Pro Arg 

3 00 

gaa gta gtg 
Glu Val Val 
315 

ggc aag aag 
Gly Lys Lys 

330 



aca gat ttg 

Thr Asp Leu 
19 0 

ctt gtc aaa 

Leu Val Lys 

2 05 

gaa etc gca 

Glu Leu Ala 



ctt atg tac 
Leu Met Tyr 



aca gca cag 

Thr Ala Gin 
255 

aaa gta aaa 

Lys Va 1 Lys 
27 0 

aca etc gca 

Thr Phe Ala 

285 

etc aac get 

Phe Asn Ala 



gga aga aag 
Gly Arg Lys 



aag gaa gcg 
Lys Glu Ala 
335 



ttc 576 
Phe 



gee 624 
Ala 



tac 672 
Tyr 



gaa 72 0 

Glu 

240 

tgg 768 
Trp 



gaa 8 1 6 
Glu 

aaa 3 64 
Lys 

ate 912 
He 

ctt 9 6::) 

Leu 

320 

gtg 1008 
Val 



1029 



<210> 36 

< 2 L 1 > 3 4 2 
<2L2> PRT 

< 2 1 3 > Baci 11 us subtilis 

< 4 0 0 > 3 6 

Met Val Lys Val Tyr Tyr Asn Gly Asp He Lys Glu Asn Val Leu Ala 
15 10 15 

Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 

2 0 2 5 3 0 

Ala Leu Asn Leu Ly s Glu Ser Gly '/a 1 Asp Va 1 He Va 1 G 1 y Va 1 Arg 
3 5 4 0 4 5 

Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val Phe 
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5 0 5 5 6 0 

Ser Val Lys Glu Ala Ala Ala Gin Ala Glu lie lie Met Val Leu Leu 

65 70 75 30 

Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu lie Lys Asp Glu 

85 90 95 

Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 

100 105 110 

Phe His Gin lie Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 

115 120 125 

Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr Glu Gin Gly Ala 

130 135 14 0 

Gly Val Pro Ala Leu Phe Ala lie Tyr Gin Asp Val Thr Gly Glu Ala 

145 150 155 16 0 

Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly lie Gly Gly Ala Arg Ala 

165 170 175 

Gly Val L eu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp ^eu Phe 

180 185 190 

Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 

19 5 2 0 0 2 0 5 

Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 

210 215 2 2 0 

Phe Glu Cys Leu His Glu Leu Lys Leu lie Val Asp Leu Met Tyr Glu 

2.25 230 235 24 0 

Glu Gly Leu Ala Gly Met Arg Tyr Ser lie Ser Asp Thr Ala Gin Trp 

245 250 255 

Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 



60 265 270 



Ser Met Lys Glu Val Leu Lys Asp lie Gin Asn Gly Thr Phe Ala Lys 

275 2 80 285 

Glu Trp lie Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala lie 

290 295 300 

Asn Ala Ser Glu Asn Glu His Gin lie Glu Val Val Gly Arg Lys Leu 

3 05 310 315 320 

Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 

325 330 335 



/al Ser Val Ala Gin Asn 

340 
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< 2 2 0 > 
<221> CDS 

<222> ( 1 ) . . ( 1674 ) 

< 4 j 0 > 3 7 

a:a gca gaa tta cgc agt aat atg ate aca caa gga ate gat aga get 48 

Met Ala Glu Leu Arg Ser Asn Met lie Thr Gin Gly lie Asp Arg Ala 
15 10 15 



cac cgc agt ttg ctt cgt gca gca ggg gta aaa gaa gag gat ttc 96 

Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys Glu Glu Asp Phe 

2 0 2 5 3 0 

ggc aag ccg ttt att gcg gtg tgt aat tea tac att gat ate gtt ccc 144 

Gly Lys Pro Phe lie Ala Val Cys Asn Ser Tyr lie Asp lie Val Pro 

3 5 4 0 4 5 

ggt cat gtt cac ttg cag gag ttt ggg aaa ate gta aaa gaa gca ate 192 

Gly His Val His Leu Gin Glu Phe Gly Lys lie Val Lys Glu Ala lie 

5 D 5 5 6 0 

aga gaa gca ggg ggc gtt ccg ttt gaa ttt aat ace att ggg gta gat 2 40 

Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr lie Gly Val Asp 

6 5 7 0 7 5 3 0 

gat ggc ate gca atg ggg cat ate ggt atg aga tat teg ctg cca age 2 83 

Asp Gly lie Ala Met Gly His lie Gly Met Arg Tyr Ser Leu Pro Ser 

8 5 9 0 9 5 

cgt gaa att ate gca gac tct gtg gaa acg gtt gta tec gca cac tgg 33 6 

Arg Glu lie lie Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 

10 0 1 } 5 110 

ttt gac gga atg gtc tgt att ccg aac tgc gac aaa ate aca ccg gga 3 34 

Phe Asp Gly Met Val Cys lie Pro Asn Cys Asp Lys lie Thr Pro Gly 

115 120 125 

atg ctt atg gcg gca atg cgc ate aac att ccg acg att ttt gtc age 43 2 

Met Leu Met Ala Ala Met Arg He Asn He Pro Thr He Phe Val Ser 

13 0 135 14 0 

ggc gga ccg atg gcg gca gga aga aca agt tac ggg cga aaa ate tec 480 

Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys He Ser 

14 5 150 155 160 

ctt tec tea gta ttc gaa ggg gta ggc gee tac caa gca ggg aaa ate 52 3 

Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys He 

165 170 175 

aac gaa aac gag ctt caa gaa eta gag cag ttc gga tgc cca acg tgc 576 

Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys 

ISO 135 19 D 

ggg tct tgc tea ggc atg ttt acg gcg aac tea atg aac tgt ctg tea 624 

y Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met Asn Cys Leu Ser 



19 5 2 0 3 2 0 5 



7 ^ 



gaa gca ctt ggt ctt get ttg ccg ggt aat gga acc att ctg gca aca 
Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr He Leu Ala Thr 



^ .J ^ t D ^ ^ J 
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tct ccg gaa cgc aaa gag ttt gtg aga aaa teg get gcg caa tta atg 720 

Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 

225 23 0 235 240 

gaa acg att cgc aaa gat ate aaa ccg cgt gat att gtt aca gta aaa 7 6 3 

Glu Thr lie Arg Lys Asp lie Lys Pro Arg Asp lie Val Thr Val Lys 

245 250 255 

gcg att gat aac gcg ttt gca etc gat atg gcg etc gga ggt tct aca 816 

Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 

2 6 0 2 6 5 2 7 D 

aat ace gtt etc cat acc ctt gec ctt gca aac gaa gee ggc gtt gaa 864 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 

275 280 285 

tac tct tta gaa cgc att aac gaa gtc get gag cgc gtg ccg cac ttg 912 

Tyr Ser Leu Glu Arg lie Asn Glu Val Ala Glu Arg Val Pro His Leu 

29 0 295 300 

get aag ctg gcg cct gca teg gat gtg ttt att gaa gat ctt cac gaa 9 60 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe lie Glu Asp Leu His Glu 

305 310 315 320 

gcg ggc ggc gtt: tea gcg get ctg aat gag ctt teg aag aaa gaa gga 1008 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 

325 330 3 35 

gcg ctt cat tta gat gcg ctg act gtt aca gga aaa act ctt gga gaa 1056 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 

3 4 0 3 4 5 3 5 0 

ace att gee gga cat gaa gta aag gat tat gac gtc att cac ccg ctg 11)4 

Thr lie Ala Gly His Glu Val Lys Asp Tyr Asp Val lie His Pro Leu 

355 360 365 

gat caa cca ttc act gaa aag gga ggc ctt get gtt tta ttc ggt aat 1152 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 

37 0 375 38 0 

eta get ccg gac ggc get ate att aaa aca ggc ggc gta cag aat ggg 12 00 

Leu Ala Pro Asp Gly Ala lie lie Lys Thr Gly Gly Val Gin Asn Gly 

3 35 390 395 40 0 

att aca aga cac gaa ggg ccg get gtc gta etc gac tct cag gac gag 12 4 3 

lie Thr Arg His Glu Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 

405 410 415 

gcg ctt gac ggc att ate aac cga aaa gta aaa gaa ggc gac gtt gtc 1296 

Ala Leu Asp Gly lie lie Asn Arg Lys Val Lys Glu Gly Asp Val Val 

42 0 42 5 43 0 

ate ate aga tac gaa ggg cca aaa ggc gga cct ggc atg ccg gaa atg 13 44 

lie lie Arg Tyr Glu Gly Pro Lys Gly Gly Pro Gly Met Pro Glu Met 

435 44 0 445 



ctg gcg cca aca tec caa ate gtt gga atg gga etc ggg cca aaa gtg 139 n 
Leu Ala Pro Thr Ser Gin lie Val Gly Met Gly Leu Gly Pro Lys Val 



t) 0 *± „) t) G J 
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gca ttg att acg gac gga cgt ttt tec gga gec tec cgt ggc etc tea 1440 

Ala Leu lie Thr Asp Gly Arg Phe Ser Gly Ala Ser Arg Gly Leu Ser 

465 473 475 480 

ate ggc cac gta tea eet gag gee get gag ggc ggg ccg ett gee ttt 14 83 

lie Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly Pro Leu Ala Phe 

435 49 0 495 

gtt gaa aac gga gac cat att ate gtt gat att gaa aaa cgc ate ttg 153 6 

Val Glu Asn Gly Asp His lie lie Val Asp lie Glu Lys Arg lie Leu 

5 0 0 5 0 5 51 0 

gat gta eaa gtg cea gaa gaa gag tgg gaa aaa ega aaa gcg aac tgg 15 84 

Asp Val Gin Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 52 0 525 

aaa ggt ttt gaa ccg aaa gtg aaa acc ggc tac ctg gca cgt tat tct 1632 

Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
530 535 540 

aaa ctt gtg aca agt gee aac ace ggc ggt att atg aaa ate 167 4 

Lys Leu Val Thr Ser Ala Asn Thr Gly Gly lie Met Lys lie 

5 4 5 5 5 0 5 5 5 



< 2 1 0 > 3 8 
<2 11> 553 
<212> PRT 

<213> Bacillus subtilis 
<400> 3 8 

Met Ala Glu Leu Arg Ser Asn Met lie Thr Gin Gly lie Asp Arg Ala 
15 10 15 

Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys Glu Glu Asp Phe 

2 0 2 5 3 0 

Gly Lys Pro Phe lie Ala Val Cys Asn Ser Tyr lie Asp lie Val Pro 
3 5 4 0 4 5 

Gly His Val His Leu Gin Glu Phe Gly Lys lie Val Lys Glu Ala lie 
5 0 5 5 6 0 

Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr lie Gly Val Asp 

6 5 7 0 7 5 8 0 

Asp Gly lie Ala Met Gly His lie Gly Met Arg Tyr Ser Leu Pro Ser 

85 90 95 

Arg Glu Tie lie Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 

100 105 110 

Phe Asp Gly Met Val Cys lie Pro Asn Cys Asp Lys lie Thr Pro Gly 
115 123 125 

Met Leu Met Ala Ala Met Arg lie Asn lie Pro Thr lie Phe Val Ser 
13 0 135 140 

Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys lie Ser 

x *± "Z ± 3 J ^ 3 "D lO-J 
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Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys lie 

165 170 17 5 

Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys Pre Thr Cys 

18 0 13 5 19 0 

Gly Ser Cys Ser Gly Met Phe Thr Ala Asn Ser Met. Asn Cys Leu Ser 
19 5 2 0 0 2 0 5 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr lie Leu Ala Thr 

210 215 21C 

Ser Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 240 

Glu Thr lie Arg Lys Asp lie Lys Pro Arg Asp Tie Val Thr Val Lys 

245 250 255 

Ala lie Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 

2 6 0 2 6 5 2 7 0 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 

2 7 5 2 3 0 2 3 5 

Tyr Ser Leu Glu Arg Tie Asn Glu Val Ala Glu Arg Val Pro His Leu 

2 9 0 2 9 5 3 0 0 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe Tie Glu Asp Leu His Glu 
305 310 315 320 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 

32 5 330 335 

Ala Leu His Leu Asp Ala Leu Thr Va 1 Thr Gly Lys Thr Leu (Sly G Lu 

3 4 0 3 4 5 3 5 J 

Thr lie Ala Gly His Glu Val Lys Asp Tyr Asp Veil Tie His Pro Leu 

355 36 0 3 65 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 

3 7-0 3 7 5 3 3 0 

Leu Ala Pro Asp Gly Ala Tie lie Lys Thr Gly Gly Val Gin Asn Gly 
3 35 3 90 3 95 400 

lie Thr Arg His Glu Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 

40 5 410 415 

Ala Leu Asp Gly Tie Tie Asn Arg Lys Val Lys Glu Gly Asp Val Val 

420 425 430 

lie Tie Arg Tyr Glu Gly Pro Lys Gly Gly Pro Gly Met Pro Glu Met 
435 443 445 

Leu Ala Pro Thr Ser Gin Tie Val Gly Met Gly Lou Gly Pro Lys Val 

4 5 D 455 46 0 

Ala Leu Tie Thr Asp Gly Ar g Phe Ser Gly Ala Ser Arg Gly Leu Ser 
465 47 0 475 430 



_ 1 e Gly His Va 1 Ser Pro 3 1 u Ala Ala Glu Gly Gly Pro Leu Ala Phe 
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4S5 490 495 

Val Glu Asn Gly Asp His lie lie Val Asp lie Glu Lys Arg lie Leu 

50 0 505 510 

Asp Val Gin Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 520 525 

Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
53 0 535 540 

Lys Leu Val Thr Ser Ala Asn Thr Gly Gly lie Met Lys lie 
545 550 555 



<_10> 3 9 

<„il> 194 

< 2 1 2 > DNA 

<_13-> Artificial Sequence 
<LL 0 > 

<^.23> Description of Artificial Sequence : promoter 
sequence 

< . : 2 tj > 

<221> -3 5_signal 

<„12> (136) . . (141) 

< 2 2 0 > 

<2-I> -10__signal 
<..22> (159) . . (164) 

< 4 0 'J "* 3 9 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 6 0 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
gcaaccccgc ctgt 194 



< 2 1 0 > 4 0 
<211> 163 
<2 1.2> DNA 

<2L3> Artificial Sequence 

< 2 2 0 > 

<22 3> Description of Artificial Sequence : promoter 
sequence 

< 2 2 0 

< 2 2 1 '* -3 5_signal 

< 2 2 2 > (113).. (US) 



<22 1> -10_signal 
< 2 2 2 > (136).. (141) 

gcc t acc tag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 6 0 
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nacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atcnacaagg tgtggtataa taatctcaac aacagcagga cgc 163 



<2L'-> 41 
<2L1.:> 127 
<Jl-> DIJA 

< 2 1 > Artificial S equ enc e 
<2 2 ..■> 

<2 2 3> Description of Artificial Sequence : promoter 
sequence 

< 2 1 .> 

<2 2 1> -3 5__signal 
<222> (34) . . (39) 

< 2 2 v> 

<12 1> -10_signal 
<22. > (53) . . (63 ) 

< 2 2 > 

<2.:*> -3S__signal 
<;;:.!> (75) . . (80) 



<.;..!> -10__signal 
-:122> (93) . . (103) 

< 4 j > 4 1 

gaggaatca: agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 

aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
tgcaatg 127 



< 2 1 0 > 4 2 

< 2 1 2 > 2 4 

< 2 1 2 > DN A 

<2 13> Artificial Sequence 



<2l?> Description of Artificial Sequence : ribosome 
binding site 

<40"-> 4 2 

t.aa.icatgag gaggagaaaa catg 24 
< 1 1 " - 4 3 

... ; : •» 23 

; i . > DUA 

<*.!:■> Artificial Sequence 



joscr iot ion of Artificial Se cue nee : r i bosor.e 



b 1 ; :< 



ding site 



BGI-14 1CP 



- 4S - 



attcgagaaa tggagagaat ataatatg 



2 1 



> 4 4 

> 13 
211 > DIIA 

213> Artificial Sequence 



2 m> 



23> Description of Artificial Sequence : r i bo some 
binding site 



'> 4 4 



agaaaggagg tga 

vlln> 4 5 

< 1 1. 1 > 2 3 
<2 12> Df J A 

<2i3> Artificial Sequence 

<1 2 0> 

<..23> Description of Artificial Sequence : r i bo some 
binding site 

<4'.)0> 4 5 

ttaagaaagg aggtgannnn atg 

^21 u > 4 6 

< 2 1 1 > 2 3 
<2 12> DNA 

<113> Artificial Sequence 

< 2 2 0> 

<22 3> Description of Artificial Sequence : ribosome 
binding site 

<: 4 0 0 > 4 6 

ttagaaagga ggtgannnnn atg 



2 1 0 > 4 7 
< 2 1 1 > 



2 3 



< 2 1 2 > DIIA 

< 2 1. 3 > Artificial Sequence 

< 2 2 0 > 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 47 

cigaaaggagg tgannnnnnn atg 

•:jl2> 4R 

< : 1 1> 2 2 

* ' Z 1 ^ > E'l JA 

^2 1?^> Artificial Sequence 



3> Description of Artificial Sequence : ribosor.e 
bindina site 
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< 4 J 0 > 4 8 

ajrtdaggagg tgannnnnna tg 2 2 

<2 1 j> 4 9 

< 2 1 1 > 2 5 
<2 12^ DNA 

<2 13> Artificial Sequence 

< 2 2 J > 

<j23> Description of Artificial Sequence : ribosome 
binding site 

< 4 1 0 > 4 9 

ccctctagaa ggaggagaaa acatg 2 5 

<;;:. j> 5 0 
<::i> 2 4 

<2I2> DNA 

<:22":> Artificial Sequence 



<223> Description of Artificial Sequence : ribosome 
binding site 

< 4 0 0 > 5 0 

occtctagag gaggagaaaa catg 2 4 

< 2 1 0 > 5 1 
<:211> 23 
<212> DNA 

< 2 1 5 > Artificial Sequenc e 

< 2 2 0 > 

<2 2 3. > Description of Artificial Sequence : ribosome 
binding site 

< 4 ■") 0 > 5 1 

ttagaaagga ggatttaaat atg 23 

<.: L j> 5 2 

<211> 23 

< 2 L 2 > DNA 

<.:Lj> Artificial Sequence 
-.2 2 0 > 

<223> Description of Artificial Sequence: ribosome 
binding site 

<40J> 52 

ttagaaagga ggtttaatta atg 2 3 

<21j> 53 

< 2 1 1 > 2 3 

< 2 1 2 > DNA 

<21?> Artificial Sequence 



<222 > Description of Artificial Sequence : ribosome 
binding site 
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t tagaaagga ggtgat 1 1 a a atg 2 3 



<21v> 5 



A 

-J 1 



2i:> 23 
• oriA 



Artificial Sequence 



<.:.'!.?> Description of Artificial Sequence : ri bo some 
binding site 

<-;■:>■:■■> 54 

t tagaaagga ggtgtttaaa atg 2 3 

<2i:> 55 
<211> 28 
<2L2> DNA 

<213> Artificial Sequence 

< 22 0> 

<223> Description of Artificial Sequence : ribosome 
binding site 

< 4 ".) 0 > 5 5 

attcgagaaa ggaggtgaat ataatatg 28 

< 2 1 -J > So 
<:.;:. 1> 2 7 
<2 1.2> DNA 

<2i3> Artificial Sequence 



<2 2/ > Description of Artificial Sequence : ribosome 
binding site 

<4 0 0> 5 6 

attcgagaaa ggaggtgaat aataatg 27 

< 2 1 '2 > 5 7 
<2 11> 2 3 
<21.:> DIJA 

<:213> Artificial Sequence 

< 2 2 0 > 

<.:2?> Description of Artificial Sequence : ribosome 
binding site 



' , 00> 57 



attcgtagaa aggaggtgaa ttaatatg 2 8 



<:;:::i> 5^ 

<2ll> 329 1 
<2 12> DNA 

< 2 1 3 > Bacillus subtilis 

< 4 :) : > 5 ? 

atggggacta atgtacaggt ggattcagca 
g r a 1 1 a a t g c tgattgaatc attaaaaaaa 



tctgccgaat gtacacagac gatgagcgga 
gagaaagtag aaatgatctt cggttatccg 
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99 c 9999 ct 9 tgcttccgat ttacgataag ctatacaatt cagggttggt acatatcctt 18! 

ccccgtcacg aacaaggagc aattcatgca gcggagggat acgcaagggt ctccggaaaa 24! 

ccgggtgtcg tcattgccac gtcagggccg ggagcgacaa accttgttac ag^ccttgct 3 J! 

gatgccatga ttgattcatt gccgttagtc gtctttacag ggcaggtagc aacctctgta 36! 

atcgggagcg atgcatttca ggaagcagac attttaggga ttacgatgcc agtaacaaaa 42-! 

cacagctacc aggttcgcca gccggaagat ctgccgcgca tcattaaaga agcgttccat 4 3! 

attgcaacaa ctggaagacc cggacctgta ttgattgata ttccgaaaga tgtagcaaca 54-:! 

attgaaggag aattcagcta cgatcatgag atgaatctcc cgggatacca gccgacaaca 60-. 

gagccgaatt atttgcagat ccgcaagctt gtggaagccg tgagcagtgc gaaaaaaccg 6o( 

gtgatcctgg cgggtgcggg cgtactgcac ggaaaagcgt cagaagaatt aaaaaattat 72<! 

gctgaacagc agcaaatccc tgtggcacac accctnttgg ggctcggagg ccrcccggct 7 H ■! 

gaccatccgc ttttcctagg gatggcggga atgcacggta cttatacagc caatatggcc 84! 

cttcatgaat gtgatctatt aatcagtatc ggcgcccgtt ttgatgaccg tgtcacagga 9 ">l 

aacctgaaac actttgccag aaacgcaaag atagcccaca tcgatattga tccagctgaa 96'! 

atcggaaaaa tcatgaaaac acagattcct gtagtcggag acagcaaaat tgtcctgcag 1020 

gagctgatca aacaagacgg caaacaaagc gatccaagcg aatggaaaaa acagctcgca 10 3 0 

gaatggaaag aagagtatcc gctctggta: gtagataatg aagaagaagg ttttaaacct 1140 

cagaaattga ttgaatatat tcatcaattt acaaaaggag aggccattgt cgcaacggat 1200 

gtaggccagc atcaaatgcg gtcagcgcaa ttttatccgt tccaaaaagc agataaatgg 1260 

gtcacgtcag gcggacttgg aacgatggga ttcggtcttc cggcggcgar cggcgcacag 1320 

ctggccgaaa aagatgctac tgttgtcgcg gtcgtcggag acggcggatc ccaaacgacg 13S0 

cttcaagaac tcgatgttat tcgcgaatta aatcttccgg tcaaggtagc gattttaaat 1440 

aacgcttgtc tcggaatggt cagacagtgg caggaaattt tctatgaaga acgttattca 15 j0 

gaa:ctaaat tcgcttctca gcctgacttc gtcaaattgt ccgaagcata cggca:taaa 1500 

ggcatcagaa tttcatcaga agcggaagca aaggaaaagc tggaagaggc a::aacacca 1620 

agagaacctg ttgtcattga cgtgcgggtt gccagcgaag aaaaagtati cccgatggtg 16 '3 0 

gctccgggga aagggctgca tgaaatggtg ggggtgaaac cttgaaaaga attatcacat 1740 

tgactgtggt gaaccgctcc ggggtgttaa accggatcac cggtctattc acaaaaaggc 1.300 

attacaacat tgaaagcatt acagt tggac acacagaaac agccggcgtt :ccagaatca 1 :3 6 ! ; 

cctncgtcgt tcatgttgaa ggtgaaaatg atgttgaaca gt^aacgaaa cagctcaaca 192 0 
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aacagattga tgtgctgaaa gtcacagaca tcacaaatca atcgattgtc cagagggagc 1980 

tggeettaat caaggttgtc tccgcac:tt caacaagaac agagattaat. ggaatcatag -2 0 4 0 

aaccjtttag agcctctgtc gttgatgtca gcagagacag catcgttgtt caggtgacag J 1 ':> j 

grgaatctaa caaaatcgaa gcgcttattg agttattaaa accttatggc attaaagaaa 2: 1 -2 2 

tcgcgagaac aggtacaacg gcttttgcga ggggaaccag caaaaggcgt catccaataa _22j 

aacaatatct a:tgtataaa acataacaag ggagagattg aaatggtaaa agtatattat 2 2 80 

aacggtgata tcaaagagaa cgtattggct ggaaaaacag tagcggttat cgggtacggt 2340 

ccgcaaggcc acgcacatgc cctgaacctt aaagaaagcg gagtagacgt gatcgtcggt 2:400 

grtagacaag gaaaatcttt cactcaagcc caagaagacg gacataaagt attttcagta J: ** 2- 

aaagaagcgg cagcccaagc cgaaatcatc atggttctgc ttccggatga gcagcagcaa 2 52 j 

aaag:atacg aagctgaaat caaagatgaa ttgacagcag gaaaatcatt agtattcgct 25 8 0 

catggactta acgcgcattt ccatcaaatt gttcctccgg cggatgtaga tgtattctta 2640 

gcggccccta aaggcccggg acacttggta agaagaacat atgagcaagg agctggcgta 2700 

cctgcattgt tcgcaatcta tcaagatgtg actggagaag caagagacaa agccctcgct 2760 

tatgctaaag gaatcggcgg cgcaagagcg ggcgtattag aaacgacatt taaagaagaa 2 82 0 

acagaaacag atttgttcgg tgagcaagca gttctttgcg gcggattaag cgcgcctgtc 2880 

aaagccggat ttgaaacctt aactgaagca ggttatcagc ctgaacttgc atact:cgag 2940 

tgtcr tcatg agctgaaatt aatcgtagac cttatgtacg aagaaggact tgcaggaatg 3000 

agacattcaa tctccgacac agcacagtgg ggagatttcg tatcaggccc tcgcgttgtg 3060 

gacgccaaag taaaagaatc tatgaaagaa gtattaaaag atatccaaaa cggtacattc 3 120 

gcaaaagagt ggatcgtcga aaaccaagta aaccgtcctc gtttcaacgc tatcaatgca 3180 

agcgagaacg aacatcaaat cgaagtagtg ggaagaaagc ttcgtgaaat gatgccgttt 3240 

gtgaaacaag gcaagaagaa ggaagcggtg gtctccgttg cgcaaaatta a 3291 



< 2 1 0 > 5 9 

<211% 2363 

<2 12^ DNA 

-■:2 13> Bacillus subtilis 
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<221> CDS 

< 2 2 2 > (19 39).. {2319} 
<4 30> 59 

ttggtacaag cccgttgatt ttggtatact tccattgggc agtatcgcct gcgaactgca 60 

cctattatta aaatagatag acattgcagc agtctgcctt gatccaaaaa aggactggga 12 3 

cagagggatg aaactcgccg aactttagaa agtgaagaat ccttctcgtt gtaacggaag 180 

gtcttttggc ttgcagaaga aaacggcaga tcatctcctc taaacatgag gaggagaaaa 24Ci 

c atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 2S9 
Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 

1 5 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 3 37 

lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 

2 0 2 5 3 0 

eaa gcg gga gtt gac atg att tta gtc ggt gat tea ctt gga atg gtc 3 85 

Gin Ala Gly Val Asp Met lie Leu Val Gly Asp Ser Leu Gly Met Val 

3 5 4 0 4 5 

gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 43 3 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met lie 

5 0 5 5 6 0 

cat cat aca aaa gec gtt aaa agg ggt gcg ccg aat acc ttt att gtg 4 81 

His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe lie Val 

6 5 7 0 7 5 8 0 

aca gat atg ccg ttt atg tct tat cac ctg tct aag gaa gat acg ctg 52 9 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 

8 5 9 0 9 5 

aaa aat gca gcg get ate gtt cag gaa age gga get gac gca ctg aag 57 7 

Lys Asn Ala Ala Ala lie Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 

10 0 105 110 

ctt gag ggc gga gaa ggc gtg ttt gaa tee att cgc gca ttg acg ctt 62 5 

Leu Glu Gly Gly Glu Gly Val Phe Glu Ser lie Arg Ala Leu Thr Leu 

115 12 0 125 

gga ggc att cca gta gtc agt cac tta ggt ttg aca ccg cag tea gtc 67 3 

Gly Gly lie Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 

130 135 140 

ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa caa age gee 721 

Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 

145 15 0 155 16 0 

aaa aaa tta at a gaa gac agt ata aaa tgc gaa gaa gca gga get atg 769 

Lys Lys Leu lie Glu Asp Ser lie Lys Cys Glu Glu Ala Gly Ala Met 

165 17 0 175 

atg ctt gtg ctg gaa tgt gtg ccg gca gaa etc aca gee aaa att gee 817 

Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys He Ala 

1^0 185 190 

gag acg eta age ata ccg g"c att gga ate ggg get ggt gtg aaa gcg 865 
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Glu Thr Leu Ser lie Pro Val lie Gly lie Gly Ala Gly Val Lys Ala 



195 200 2 



gac gga caa gtt etc gtt cat cat gat att ate ggc cat ggt gtt gag 913 

Asp Gly Gin Val Leu Val Tyr His Asp lie He Gly His Gly Val Glu 

210 215 2 2;) 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 951 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 

225 23 0 235 24 0 

gaa aea gca ate age gga tat gtt eag gat gta aga eat cgt get ttc 10 39 

Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 

245 25 0 255 

cct gaa eaa aag cat tec ttt caa atg aae cag aea gtg ctt gac ggc 10 57 

Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 

260 265 27 0 

ttg tac ggg gga aaa taag atg aga cag att act gat att tea cag ctg 1106 
Leu Tyr Gly Gly Lys Met Arg Gin He Thr Asp He Ser Gin Leu 

275 280 2 35 

aaa gaa gec ata aaa eaa tac eat tea gag ggc aag tea ate gga ttt 1154 

Lys Glu Ala He Lys Gin Tyr His Ser Glu Gly Lys Ser He Gly Phe 

290 295 30 0 

gtt ccg acg atg ggg ttt ctg cat gag ggg cat tta ace tta gca gac 12 02 

Val Pro Thr Met Gly Phe Leu His Glu Gly His Leu Thr Leu Ala Asp 

3 0 5 3 10 3 15 

aaa gca aga caa gaa aac gac gec gtt att atg agt att ttt gtg aat 125 0 

Lys Ala Arg Gin Glu Asn Asp Ala Val He Met Ser He Phe Val Asn 

32 0 325 33 0 33 5 

cct gca caa ttc ggc cct aat gaa gat ttt gaa gca tat ccg cgc gat 12 9-3 

Pro Ala Gin Phe Gly Pro Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp 

34 0 345 350 

att gag egg gat gca get ctt gca gaa aac gee gga gte gat att ctt 13 46 

He Glu Arg Asp Ala Ala Leu Ala Glu Asn Ala Gly Val Asp He Leu 

355 360 365 

ttt acg cca gat get cat gat atg tat ccc ggt gaa aag aat gte acg 13 94 

Phe Thr Pro Asp Ala His Asp Met Tyr Pro Gly Glu Lys Asn Val Thr 

3 7 0 3 7 5 3 3 0 

att eat gta gaa aga cgc aca gac gtg tta tgc ggg cgc tea aga gaa 1442 

He His Val Glu Arg Arg Thr Asp Val Leu Cys Gly Arg Ser Arg Glu 

385 39 0 39 5 

gga eat ttt gac ggg gte gcg ate gta ctg acg aag ctt ttc aat eta 1490 

GJy His Phe Asp Gly Val Ala He Val Leu Thr Lys Leu Phe Asn Leu 

4 0 0 40 5 410 415 



gte aag ccg act egt gee tat ttc ggt tta aaa gat geg cag eag gta 153 3 

Val Lys Pro Thr Arg Ala Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val 

* ^ ^ — \ — 

-i i. J <-t ^ *D 



43 0 



get. gtt gtt gat ggg tta ate age gac ttc ttc atg gat att gaa ttg 
Ala Val Val Asp Gly Leu lie Ser Asp Phe Phe Met Asp lie Glu Leu 
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435 440 4 45 

gtt cct gtc gat acg gtc aga gag gaa gac ggc tta gcc aaa age net 1634 

Val Pro Val Asp Thr Va 1 Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser 

450 455 45 J 

cgc aat gta tac tta aca get gag gaa aga aaa gaa gcg cct aag ctg 16 32 

Arg Asn Val Tyr Leu Thr Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu 



465 47D 475 



tat egg gcc ctt caa aca agt gcg gaa ctt gtc caa gcc ggt gaa aga 17 3 0 

Tyr Arg Ala Leu Gin Thr Ser Ala Glu Leu Val Gin Ala Gly Glu Arg 

430 435 49 0 495 

gat cct gaa gcg gtg ata aaa get gca aaa gat ate att gaa acg act 177 3 

Asp Pro Glu Ala Val lie Lys Ala Ala Lys Asp lie lie Glu Thr Thr 

5 00 505 510 

age gga acc ata gac tat gta gag ctt tat tec tat ceg gaa etc gag 182 6 

Ser Gly Thr lie Asp Tyr Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu 

515 520 525 

cct gtg aat gaa att get gga aag atg att etc get gtt gca gtt get 187 4 

Pro Val Asn Glu lie Ala Gly Lys Met lie Leu Ala Val Ala Val Ala 

5 3 0 5 3 5 5 4 0 

ttt tea aaa gcg cgt tta ata gat aat ate att att gat att ega gaa 1922 

Phe Ser Lys Ala Arg Leu lie Asp Asn lie lie lie Asp lie Arg Glu 

545 55 0 55 5 

atg gag aga ata taat atg tat ega aca atg atg age ggc aaa ctt cac 1971 
Met Glu Arg lie Met Tyr Arg Thr Met Met Ser Gly Lys Leu His 

56 0 565 570 

a g9 gca act gtt acg gaa gca aac ctg aac tat gtg gga age att aca 2 019 

Arg Ala Thr Val Thr Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr 

575 530 535 590 

att gat gaa gat etc att gat get gtg gga atg ctt cct aat gaa aaa 2 067 

lie Asp Glu Asp Leu lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys 

595 600 6 0 5 

gta caa att gtg aat aat aat aat gga gca cgt ctt gaa acg tat att 2115 

Val Gin lie Val Asn Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie 

610 615 62 0 

att cct ggt aaa egg gga age ggc gtc ata tgc tta aae ggt gca gcc 2163 

lie Pro Gly Lys Arg Gly Ser Gly Val lie Cys Leu Asn Gly Ala Ala 

62 5 63 0 63 5 

gca cgc ctt gtg cag gaa gga gat aag gtc att att att tec tac aaa 2211 

Ala Arg Leu Val Gin Glu Gly Asp Lys Val lie lie lie Ser Tyr Lys 

64 0 645 65 3 

atg atg tct gat caa gaa gey gca age cat -gag ecg aaa gtg get gtt 22 59 

Met Met Ser' Asp Gin Glu Ala Ala Ser His Glu Pro Lys Val Ala Val 

655 66 0 665 67 0 

ctg aat gat caa aac aaa att gaa caa atg ctg ggg aac gaa oca gcc 23 37 

Leu Asn Asp Gin Asn Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala 

- - - ^ r\ *~~ *~~ 

C <■' D C r J "3^3 
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cgt aca att ttg tagaagaaaa gcccccttta tcgggggttt tcttttaaga tttt 2363 
Arg Thr lie Leu 

690 



<210> 60 
<211> 293 
<212> ?RT 

<213> Bacillus subtilis 
<400> 60 

Met Ser lie Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 
1 5 10 15 

Aon Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie Tyr lie Pro Leu 

20 25 30 

Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asr: Lys 
3 5 4 0 4 5 

H is Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys He Pro Phe 

5 0 5 5 6 0 

lie He Gly He Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 

6 5 7 0 7 5 8 0 

Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 

8 5 9 0 9 5 

Ser Leu He Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 

10 0 10 5 110 

Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 15 0 155 160 

Val Phe Glu Val Val Glu Gin Ala Asp He Val He He Glu Gly He 

165 170 175 

Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 

180 185 190 

He Phe Val Ser Asp Phe Phe Asp Phe Ser He Tyr Val Asp Ala Glu 
195 200 2 05 

Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg 

210 215 2 2 0 

Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 23 D 235 240 

Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser lie Trp Glu Ser 

Z 4 ^ z. 3 'J ^ 

Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro Thr Lys Phe Arg 
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2 6 C 2 6 5 2 7 3 

Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys Val Glu GIu Val 

2 7 5 2 S 0 2 3 5 

Leu Val Arg Arg Val 

2 9 2 



<no- -:i 

<211-> 181 
<L12> ?RT 

<213> Bacillus subtilis 
<400- ei 

Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie 

15 10 15 

Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 

2 0 2 5 3 0 

Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
3 5 4 0 4 5 

Lys lie Pro Phe lie lie Gly He Ala Gly Ser Val Ala Val Gly Lys 

5 0 5 5 6 0 

Ser Thr Thr Ala Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp 
65 70 75 80 

Arg Pro Lys Val Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr 

8 5 9 0 9 5 

Ala Glu Leu Lys Lys Lys Asn Men Met Ser Arg Lys Gly Phe Pro Glu 

1 0 0 10 5 110 

Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 
115 12 0 125 

Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
1 3 0 13 5 14 0 

Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp lie Val lie 
145 150 155 160 

lie Glu Gly lie Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 

165 170 175 

Glu Asn Pro Arg lie Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr 

18 0 13 5 19 0 

Val Asp Ala Glu Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe 

19 5 2 0 0 2 0 5 

Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 

15 



1 "> O 1 C O r-- 

j-j _i_ - ; £^ _i_ D ti. jl. \J 



Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 23 5 240 

lie Trp Glu Ser Va 1 Asn Arg Pro Asn Leu Ty r Glu Asn lie Leu P r o 
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245 250 255 

Thr Lys Phe Arg Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys 

260 265 27 0 

Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 62 
<111> 1092 
<112> DNA 

< 2 1 3 > 3a c 1 1 lus subti lis 

< 2 2 0 > 

<221> CDS 

<_22.> ( 1 ) . . ( 1039 ) 

<400> 62 

a eg act aaa eaa aca a 1 1 cgc gt t gaa ttg aca tea aca aaa aaa ccg 4 8 

Met Thr Lys Gin Thr lie Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
1 5 10 15 

aaa cca gac cca aat cag ctt teg ttc gga aga gtg ttt aca gac cac 9 6 
Lys Pro Asp Pro Asn Gin Leu Ser Phe Gly Arg Val Phe Thr Asp His 

2 0 2 5 3 0 

atg ttt gta atg gac tat gec gca gat aaa ggt tgg tac gat cca aga 144 
Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
3 5 4 0 4 5 

ate att cot tat eaa ccc tta tea atg gat cca act gca atg gte tat 19 2 
lie lie Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Va 1 Tyr 
5 0 5 5 6 0 

cae tae ggc eaa acc gtg ttt gaa ggg tta aag get tac gtg tea gag 240 
His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 7 0 75 80 

gat gac eat gtt etg ett ttc aga ccg gaa aaa aat atg gaa cge etg 28 8 
Asp Asp His Va 1 Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 

85 93 95 

aat eaa tea aae gac ege etc tgc ate eeg eaa att gat gaa gaa eag 3 36 
Asn Gin Ser Asn Asp Arg Leu Cys lie Pro Gin lie Asp Glu Glu Gin 

1 D 0 10 5 110 

gtt ett gaa ggc tta aag cag ett gte gea att gat aaa gac tgg att 3 84 
Val Leu Glu Gly Leu Lys Gin Leu Val Ala lie Asp Lys Asp Trp lie 
115 12 0 12 5 

cca aat gcg gag ggc aeg tec ett tac ate egt ccg ttc ate ate gca 43 2 
Pro Asn Ala Glu Gly Thr Ser Leu Tyr lie Arg Pro Phe lie lie Ala 
13 0 13 5 14 0 

ace gag ect. tte ett ggt gtt gcg gca tet cat acg tat aag etc ttg 4?0 
Thr Glu ? r o Phe Leu Gly Val Ala Ala Ser His Thr Ty r Lys Leu Leu 
145 15 j 155 160 



ate att ett tet eeg gte ggc tct tat tac aaa gaa ggc att aag eeg 
lie lie Leu Ser Pro Va 1 Gly Ser Tyr Tyr Lys Glu Gly lie Lys Pro 



5 ? .3 
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165 



170 



17 5 



gte aaa ate get gtt gaa agt gaa ttt gtc cgt gcg gta aaa ggc gga 576 
VaL Lys lie Ala Val Glu Ser Glu Phe Va 1 Arg A La Val Lys Gly Gly 

13 0 185 19 0 



aca gga aat gee aaa ace gca gga aac tat get tea age tta aaa gcg 
Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 

19 5 2 0 0 2 05 



624 



eag eag gta gee gaa gag aaa gga ttt tct eaa gta etc tgg etg gae 
Gin Gin VaL Ala Glu Glu Lys GLy Phe Ser Gin Val Leu Trp Leu Asp 
2 1 J 2 15 2 2 0 



6 7 2 



ggc att gag aag aaa tae ate gaa gaa gtc gga age atg aae ate tte 



Gly lie Glu Lys Lys Tyr lie Glu Glu VaL Gly 



225 



230 



235 



Ser Met Asn lie Phe 

240 



7 2 0 



tte aaa ate aae ggt gaa ate gta aca ccg atg etg aac ggg age ate 
Phe Lys lie Asn Gly Glu lie Val Thr Pro Met Leu Asn GLy Ser lie 

245 250 25 5 



7 6 : ; 



etg gaa ggc att acg ege aat tea gtc ate gec ttg ett aag eat tgg 
Leu Glu Gly lie Thr Arg Asn Ser Val lie Ala Leu Leu Lys His Trp 

2 6 0 2 6 5 2 7 0 



8 L^ 



ggc ett caa gtt tea gaa ega aaa att gcg ate gat gag gtc ate caa 
Gly Leu Gin Val Ser Glu Arg Lys He Ala He Asp Glu Val He Gin 

2 7 5 2 3 0 2 3 5 



8 6 4 



gee eat aaa gae ggc ate etg gaa gaa gee tte gga aca ggt aca gca 
Ala His Lys Asp Gly He Leu Glu Glu Ala Phe GLy Thr GLy Thr Ala 
2 9 0 2 9 5 3 0 0 



91 



get gtt att tee eca gtc ggc gag etg ate tgg eag gat gaa aca ctt 
Ala Val He Ser Pro Val Gly Glu Leu He Trp Gin Asp Glu Thr Leu 



3 05 



3 10 



3 15 



32 0 



9 6 0 



teg ate aac aae ggt gaa aca gga gaa ate gca aaa aaa eta tat gae 
Ser He Asn Asn Gly Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 

3 2 5 3 3 0 3 3 5 



1 0 0 8 



acg att aca ggc att eaa aaa ggc get gtc gca gae gaa tte gga tgg 
Thr He Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe Gly Trp 

3 4 0 3 4 5 3 5 0 



1 0 5 6 



acg ace gaa gtc gca gcg etg act gaa age aag taa 
Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 



355 



3 60 



109 



; 2 L 0 > 6 3 



< 2 L 1 > 3 6 3 
<212> PRT 



> Bacillus s u b t i 



< 4 0 0 > 6 3 



Met Thr Lys G 



- 1 



^ie Arg va. Glu „eu 



r Ser Thr Lvs Lvs Pro 



v s Pro Aso Pro Asn Gin Leu Ser Phe Glv Ara Val Phe 



s o 
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Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 

3 5 4 D 4 5 

lie lie Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 

5 0 5 5 6 j 

His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 

6 5 7 J 7 5 33 

Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asr: Met Glu Arg Leu 

8 5 9 3 9 5 

Asn Gin Ser Asn Asp Arg Leu Gys lie Pro Gin lie Asp Glu Glu Gin 

100 1 3 5 110 

Val Leu Glu Gly Leu Lys Gin Leu Val Ala lie Asp Lys Asp Trp Tie 

115 120 125 

Pro Asn Ala Glu Gly Thr Ser Leu Tyr lie Arg Pro Phe lie lie Ala 

13 3 13 5 14: 

Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 

14 5 15 0 15 5 16: 

lie He Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly lie Lys Pro 

16 5 1 7 0 17 5 

Val Lys Lie Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Gly 

ISO 185 19 0 

Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 

19 5 2 00 2 05 

Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 

210 215 2 2 0 

Gly He Glu Lys Lys Tyr lie Glu Glu Val Gly Ser Met Asn He Phe 

22 5 23 0 23 5 240 

Phe Lys lie Asn Gly Glu lie Val Thr Pro Met Leu Asn Gly Ser He 

2 4 5 2 5 0 2 5 5 

Leu Glu Gly He Thr Arg Asn Ser Val He Ala Leu Leu Lys His Trp 

2 6 0 2 6 5 2 7 0 

Gly Leu Gin Val Ser Glu Arg Lys He Ala He Asp Glu Val He Gin 

2 7 5 2 3 0 2 5 

Ala His Lys Asp Gly He Leu Glu Glu Ala Phe Gly Thr Gly Thr Ala 

2 9 0 2 9 5 3 0 0 

Ala Val He Ser Pro Val Gly Glu Leu lie Trp Gin Asp Glu Thr Leu 

3 05 HO 3H 3 2 0 

Ser lie Asn Asn Gly Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 

3 2 5 3 3 C 3 3 5 

Thr lie Th r Gly He Gin Ly s G 1 y A 1 a Va 1 Ala Asp' (Glu P h e G 1 y T r p 

J ** ~ J ■* D 3d J 
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'hr Thr GIu Val Ala Ala Leu Thr Glu Ser Lys 
3 5 5 3 6 0 



<210> 64 

<211> 10 71 

<212> DNA 

< 2 1 3 > Bac i 1 lus subt i 1 i s 

<2 2 0 > 

<2 2i> CDS 

<222 > ( 1 ) . . ( 1058 ) 

< 4 0 0 > 6 4 

tig a at aag ct: att gaa cga gaa aaa act gta tat tat aag gaa aag 4 8 

Met Asn Lys Leu lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
1 5 1j 15 

ccc gac ccg net tec ttg ggg ttt gga caa tat ttt aca gat tat atg 96 
Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 

2 3 2 5 3 0 

ttt gtg atg gac tac gaa gag ggg att gga tgg cat cat ccg aga att 14 4 
Phe Val Met Asp Tyr Glu Glu Gly lie Gly Trp His His Pro Arg lie 
3 5 4 0 4 5 

gcg ccg tac gca ccg ctt acg ctt gat ccg tct tea tet gtt ttt cat 19 2 
Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
5 j 5 5 6 0 

tac ggc cag get gtt ttt gaa gga tta aaa gca tac aga aca gac gac 24 0 
Tyr Gly G In Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
6 5 7 0 7 5 3 0 

ggc agg gtg ccg ctg tte egt ccg gat caa aat ate aaa egg etg aae 23 8 
Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie Lys Arg Leu Asn 

8 5 9 0 9 5 

aga teg tgt gag cgc atg age atg eec cct tta gac gaa gag ctg gtg 33 6 
Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 

1D0 105 110 

ctt gag gea ttg acg caa tta gtt gag ctg gag aaa gat tgg gtt cca 3 34 
Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 120 125 

aag gaa aaa gga acg tea etg tat att egt cct ttt gtc att gee aca 43 2 
Lys Glu Lys Gly Thr Ser Leu Tyr lie Arg Pro Phe Val lie Ala Thr 
13 0 13 5 14 0 

3 a a ccg agt etc ggt gtg aag gca tec agg age tat .aca ttt atg ate 4 80 
Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met lie 
145 15 0 155 16 3 

gtg ctt teg cct gtc ggc tec tat tat gge gac gat eag ctg aag eeg 52 8 
al Leu Ser Pro Val Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 

^ C Z> 1 / J 1 / 3 



V' 



jtt aga ate tat gtc ^aa gat gag tat gtg agg geg gtc aae gga gga ~d / 
'/a 1 Arg lie Ty r Va 1 Glu Asp G 1 u Ty r Va 1 Arg A 1 a Va 1 Asn G 1 y G 1 y 
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13 3 



185 



1 9 



gtc ggg cct gca aaa acg get gga aac tat gec gec agt 
Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser 



cag gca 
,eu Gin Ala 



195 



2 j 0 



624 



'■:ag egg aaa geg aat gaa ctg gge tat gae cag gta ctg tgg etg gac 
Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 



2 1 J 



2 15 



2 2Z) 



672 



gec ate 
Ala He 
225 



gaa aag aaa tat gtg gaa gaa gta ggg age atg aac ate ttt 
Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn lie Phe 

2 3 0 2 3 5 



7 2 0 



etc gtc at. a aac ggg gaa get gtc aca cct get t ta age gga age at t 7 6 S 
Phe Val lie Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser He 



24 5 



2 5 J 



255 



tta age ggg gtt aca cgt gcg tct geg att gaa ttg att cga age tgg 
Leu Ser Gly Va 1 Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 

2 6 0 2 6 5 2 7 0 



8 1 6 



ggc 
Glv 



att ceg gtt cgt gaa gag aga at a teg att gat gag gtg tat gcg 
He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
2 7 5 2 3 0 2 8 5 



8 6 4 



gec tc t gca cgc gga gaa ttg aca gag gtc ttt gge aca gge acg gca 
Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 

29 j 295 3 00 



q 1 



gca gtc gtt acg cct gtc ggt gaa etc aac ate eat gga aaa acg gtg 
Ala Val Val Thr Pro Val Gly Glu Leu Asn He His Gly Lys Thr Val 
.'■0 5 3 10 3 15 320 



att gta ggc gac ggg eaa ate ggg gac etc teg aaa aag ctg tat gaa 1008 
He Val Gly Asp Gly Gin He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 

3 2 5 3 3 0 3 3 5 



acg ata aca gat att cag ctt ggc aag gta aaa ggc ceg ttt. aac tgg 



Thr He Thr Asp He 

3 40 



In Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 

3 4 5 3 5 0 



1056 



aca gtg gaa gtg tga 
Thr Val Glu Val 

3 55 



1071 



<: 2 1 0 > 6 5 

<2 11- 3 56 

-■:2 12 - PRT 

<2 1.3'~- Bacillus subtilis 

v:4 30-> 6 5 

Met Asn Lys Leu lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
1 5 1 0 1 5 

Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 



t3 



As D 



yr 



He G 1 , - 



r p His His Pro Arg lie 
45 
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Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 

5 ■:■ 55 e o 

Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
6 5 7 0 7 5 8 0 

Gly Arg Val Leu Leu Phe Arg Pro Asp Glr: Asn lie Lys Arg Leu Asn 

8 5 9 0 9 5 

Arg Ser Gys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 

1 0 0 1)5 110 

Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
1 1 5 12 0 12 5 

Lys Glu Lys Gly Thr Ser Leu Tyr lie Arg Pro Phe Val lie Ala Thr 
13 0 13 5 14 0 

Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met lie 
14 5 150 155 160 

Va 1 Leu Ser Pro Val Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 

16 5 17 0 17 5 

Val Arg lie Tyr' Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly Gly 

1 8 0 18 5 19 0 

Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
19 5 2 0 0 2 0 5 

Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 

210 215 22 0 

Ala He Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn He Phe 
225 23 0 235 24 0 

Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser lie 

2 4 5 2 5 0 2 5 5 

Leu Ser Gly Val Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 

2 6 0 2 6 5 2 7 0 

Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
2 7 5 2 8 0 2 8 5 

Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 

290 295 3)0 

Ala Val Val Thr- Pro Val Gly Glu Leu Asn He His Gly Lys Thr Val 

305 310 315 32 0 

He Val Gly Asp Gly Gin He Gly Asp Leu Ser Lys Lys Leu Tyr Glu 

3 2 5 3 3 0 3 3 5 

Thr lie Thr Asp lie Gin Leu Gly Lys Va 1 Lys Gly Pro Phe Asn Trp 

34: 345 3 5) 
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-:210^> 66 

11> 1428 

-■2.1 2 > DMA 

<213> racillus subtilis 

<12 0> 

<221> CDS 

<122> ( 1 ) . . ( 1425 ) 

•4)0> 6 6 

..-.tig tta aac ggc caa aaa gaa tat cgc gtg gaa aaa gac t tc c 1 1 ggg 4 3 

Met leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 

L 5 10 15 

gaa aaa caa att gaa gca gat gtt tat tac gga att cag acg etc cgt 9 6 
Glu Lys Gin lie Glu Ala Asp Val Tyr Tyr Gly lie Gin Thr Leu Arg 

2 0 2 5 3 0 

gec tc t gaa aat ttt ccg ate aca gga tac aaa ate eat gag gaa a tg 14 4 
Ala Ser Glu Asn Phe Pro lie Thr Gly Tyr Lys lie His Glu Glu Met 
35 40 45 

att aac gca ctg gcg att gtg .aaa aaa get gcg get ett gec aac atg 102 
lie Asn Ala Leu Ala lie Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
5 0 5 5 6 0 

gac gtg aaa egg ctg tat gaa gga att gge caa get ate gta caa gec 240 
Asp Val Lys Arg Leu Tyr Glu Gly lie Gly Gin Ala lie Val Gin Ala 
6 5 7 0 7 5 8 0 

get gac gag att ctg gaa gge aag tgg eac gat eag ttt ate gtc gat 2 33 
Ala Asp Glu lie Leu Glu Gly Lys Trp His Asp Gin Phe lie Val Asp 

8 5 9 0 9 5 

eeg att cag gge ggt gec gga act tct atg aac atg aac gcg aat gag 3 3 6 
Pro lie Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 

10 0 10 5 11 0 

gtt ate gga aac egg gcg c 1 1 gaa ate atg gga eat aaa aag gga gat 3 3 4 
Val lie Gly Asn Arg Ala Leu Glu lie Met Gly His Lys Lys Gly Asp 
115 12 0 125 

tat ate eat tta agt cca aac aca cat gtg aac atg tea cag tct cag 432 
Tyr lie His Leu Ser Pro Asn Thr His Val Asn Met Ser Gin Ser Gin 
13 0 135 14 0 

aac gat gtg ttc ccg act get ate cat att tec aca ttg aag etc tta 48 D 
Asn Asp Val Phe Pro Thr Ala lie His lie Ser Thr Leu Lys Leu Leu 
14 5 153 155 16 0 

gaa aaa ctg ctg aaa aca atg gaa gat atg cat agt gtg ttt aaa caa 523 
Glu Lys Leu Leu Lys Thr Met Glu Asp Met His Ser Val Phe Lys Gin 

16 5 17 3 17 5 

aaa gca eag gag ttt cac tct gtt att aaa atg gge egg aca cac ett 576 
Lys Ala Gin Glu Phe His Ser Val lie Lys Met Gly Arg Thr His Leu 

18 3 1S5 192 

caa gat gcg gtt rcg ate cgt ett gge eag gaa ttc gaa get tac age 624 
Gin Asp Ala Val Pro lie Arg Leu Gly Gin Glu Phe Glu Ala Tyr Ser 



BGI-141CT 




19 5 2 0 0 2 0 5 

cgt gtt etc gag cgt gat ate aaa cga ate aag caa teg cgc cag cac 672 

Arg Val Leu Glu Arg Asp lie Lys Arg lie Lys Gin Ser Arg Gin His 

2 10 215 22 0 

ctg tat gaa gtc aac atg gge gca act get gtt ggt aca ggg ctg aae 120 

Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr Gly Leu Asn 

2 2 5 2 3 0 2 1 5 2 4 0 

get gat ec t gaa tat ate aaa cag gta gta aag cac c 1 1 get gat at t 7 6 £ 

Ala Asp Pro Glu Tyr lie Lys Gin Val Val Lys His Leu Ala Asp lie 

2 4 5 2 5 0 2 5 5 

age ggg ctt cct ct t gtc gge get gat eat ctt gtt gat geg aca caa 81*: 

Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 

26 0 265 27 0 

aat aca gat gee tat aca gag gta tea get tea tta aaa gtc tgc atg 864 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Gys Met 

2 7 5 2 8 0 2 3 5 

atg aac atg teg aag ate gea aac gac etg cgc tta atg gcg teg gga 9 1 2 

Met Asn Met Ser Lys lie Ala Asn Asp Leu Arg Leu Met Ala Ser Gly 

2 9 3 2 9 5 3 0 0 

eeg cgc gec gga ctt gcg gaa att tet etg cct gca cgt cag ecg ggt 960 

Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 

3 0 5 3 10 3 15 3 2 0 

tea tct att atg ccg ggg aaa gtc aat ecg gtt atg geg gag ctg ate 1 l 0 8 

Ser Ser lie Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu lie 

3 1 5 3 3 0 3 3 5 

aac caa att geg t tc cag gtt ate gga aat gac aat aca ate tgc ctt 1 0 5 6 

Asn Gin lie Ala Phe Gin Val lie Gly Asn Asp Asn Thr lie Cys Leu 

3 4 0 3 4 5 3 5 0 

gtt tea gaa gee gge cag c 1 1 gag t tg aac gtc atg gag ccc gtg ctt 1104 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 

3 55 360 365 

gtc ttt aat ttg ctt caa tee ate age ate atg aac aac gge ttc cgt 1152 

Val Phe Asn Leu Leu Gin Ser lie Ser lie Met Asn Asn Gly Phe Arg 

3 7 3 3 7 S 3 3 0 

teg ttc act gac aac tgc tta aaa g g c a 1 1 g a a gec a a c g a a a a g cgt 1 2 0 0 

Ser Phe Thr Asp Asn Gys Leu Lys Gly Tie Glu Ala Asn Glu Lys Arg 

3 85 3 90 395 4 00 

atg aag caa tae gta gaa aaa age gea gge gtg ate aca gc t gtc aat 1243 

Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val lie Thr Ala Val Asn 

4 3 5 410 415 

ccg eat ctt ggg tat gaa gcg gca get aga att gc: a gg gaa gea att 1296 

Pro His Leu Gly Tyr Glu Ala Ala Ala Arg lie Ala Arg Glu Ala lie 

J. J "± ^ "± -J o 



* -1 



atg aca ggg caa tet g t r :gg gat ctt. tgt ttg cag cat ga t gtg ctg ^ z- ^ ^ 

Met Thr G 1 y Gin Ser Va 1 Arg Asp L e ^ Cys Leu 31 n His Asp Va 1 Leu 

" ^ Z 1 1 n * 1 ^, 

^ J ._' *4 *■* ~ >~i 0 
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act gaa gaa gaa ttg gat att att tta aac cca tat gag atg acc aaa 1392 
Thr Glu Glu Glu Leu Asp lie lie Leu Asn Pro Tyr Glu Met Thr Lys 
4 5 0 4 5 5 4 6 0 



cca ggt ate gca ggg aaa gaa eta tta gaa aaa taa 1428 
Pro Gly lie Ala Gly Lys Glu Leu Leu Glu Lys 
465 47 0 475 



< 2 1 0 > 6 7 
<::ll> 47 5 
<212> ?R r 

<213> Bacillus subtilrs 
<4 00> 6 7 

Mot Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
15 10 15 

G^u Lys Gin He Glu Ala Asp Val Tyr Tyr Gly lie Gin Thr Leu Arg 

2 0 2 5 3 0 

Ala Ser Glu Asn Phe Pro lie Thr Gly Tyr Lys lie His Glu Glu Met 
3 5 4 0 4 5 

lie Asn Ala Leu Ala lie Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
5 0 5 5 6 0 

Asp Val Lys Arg Leu Tyr Glu Gly lie Gly Gin Ala lie Val Gin Ala 

5 7 J 7 5 8 0 

Ala Asp Glu lie Leu Glu Gly Lys Trp His Asp Gin Phe lie Val Asp 

8 5 9 0 9 5 

Pro lie Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 

LOO 105 110 

Val lie Gly Asn Arg Ala Leu Glu lie Met Gly His Lys Lys Gly Asp 

L15 120 125 

Tyr lie His Leu Ser Pro Asn Thr His Val Asn Met Ser Gin Ser Gin 
13 0 L 3 5 14 0 

Asn Asp Val Phe Pro Thr Ala lie His lie Ser Thr Leu Lys Leu Leu 
145 150 155 160 

Glu Lys Leu Leu Lys Thr Met Glu Asp Met His Ser Val Phe Lys Gin 

165 170 175 

Lys Ala Gin Glu Phe His Ser Val lie Lys Met Gly Arg Thr His Leu 

180 18 5 19 0 

Gin Asp Ala Val Pro lie Arg Leu Gly Gin Glu Phe Glu Ala Tyr Ser 

19 5 2 00 2 05 

Arg Val Leu Glu Arg Asp lie Lys Arg lie Lys Gin Ser Arg Gin His 

210 215 2 2 0 

Leu Tyr Glu Val Asn Met Gly Ala Thr Ala Val Gly Thr Gly Leu Asn 
225 23 0 235 240 
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Ala Asp Pro Glu Tyr lie Lys Gin Val Val Lys His Leu Ala Asp lie 

2 4 5 2 5 0 2 5 5 

Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 

26 0 265 270 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Men 

2 7 5 2 3 0 2 8 5 

Met Asn Met Ser Lys lie Ala Asn Asp Leu Arg Leu Met Ala Ser Gly 

2 9 0 2 9 5 3 0 0 

Pro Arg Ala Gly Leu Ala Glu lie Ser Leu Pro Ala Arg Gin Pro Gly 

3 05 3 10 315 32 0 

Ser Ser He Met. Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu lie 

3 2 5 3 3 0 3 3 5 

Asn Gin He Ala ?he Gin Val lie Gly Asn Asp Asn Thr lie Cys Leu 

3 4 0 3 4 5 3 5 0 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 

355 36 0 365 

Val Phe Asn Leu Leu Gin Ser lie Ser He Met Asn Asn Gly Phe Arg 

3 7 0 3 7 5 3 8 0 

Ser Phe Thr Asp Asn Cys Leu Lys Gly He Glu Ala Asn Glu Lys Arg 

3 85 390 3 95 4 00 

Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val He Thr Ala Val Asn 

4 0 5 413 415 

Pro His Leu GLy Tyr Glu Ala Ala Ala Arg He Ala Arg Glu Ala He 

42 0 425 430 

Met Thr Gly Gin Ser Val Arg Asp Leu Cys Leu Gin His Asp Val Leu 

435 440 445 

Thr Glu Glu Glu Leu Asp He He Leu Asn Pro Tyr Glu Met Thr Lys 

4 53 455 460 

Pro Gly lie Ala Gly Lys Glu Leu Leu Glu Lys 

465 470 475 



2 1 0 > 68 

<2il> 76 3 

<2 12> DNA 

*:2 13> Bacillus subtilis 



- : 2 2 0 > 



.2 2 H CD. 



. 'i _ i '-i 
z ^ ^ 



{ 1 ) - - (765) 



■i J W O 



atg aaa cga ijaa age aac att :aa gtg etc age cgt ggt caa aaa gat 
Met Lys Arg Glu Ser Asn He Gin Val Leu Ser Arg Gly Gin Lys Asp 



c a g cct gtg age c a g att tat caa g t a tea aca atg act tct eta tta 



Q ^ 
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Gin Pre. Val Ser Gin He Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 

2 0 2 5 3 0 

gac gga gta gac gga gat ttt gaa ctg tea gag att ccg aaa tat 

Asp Gly Val Tyr Asp Gly Asp ?he Glu Leu Ser Glu He Pro Lys Tyr 

35 40 45 

gga gac :tc ggt ate gga acc ttt aac aag ctt gae gga gag ctg att 

Gly Asp Phe Gly He Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu He 

5 :■ 5 5 6 0 

ggg ttt gac gge gaa ttt tac cgt ett cgc tea gae gga acc gcg aea 

Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 

6 5 7 0 7 5 3 0 

ccg gtc eaa aat gga gac cgt tea ccg ttc tgt tea ttt aeg tte ttt 

Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Gys Ser Phe Thr Phe Phe 

3 5 9 0 9 5 

aea ccg gac atg aeg eac aaa att gat gcg aaa atg aea cgc gaa gae 

Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg Glu Asp 

1 0 0 10 5 110 

ttt gaa aaa gag ate aae age atg ctg cea age aga aac tta ttt tat 

Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 

115 120 125 

gca att cgc att gac gga ttg ttt aaa aag gtg cag aca aga aea gta 

Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 

1 3 j 13 5 14 0 

gaa ctt caa gaa aaa cct tac gtg eea atg gtt gaa gcg gtc aaa aea 

Glu Leu Gin Glu Lys Pro Tyr Val Pro Met Val Glu Ala Val Lys Thr 

14 5 150 155 160 

cag ccg att ttc aac ttc gac aac gtg aga gga aeg att gta ggt ttc 

Gin Pro He Phe Asn Phe Asp Asn Val Arg Gly Thr He Val Gly Phe 

165 17 0 17 5 

ttg aca cca get tat gca aac gga ate gee gtt tct gge tat eac ctg 

Leu Thr Pro Ala Tyr Ala Asn Gly lie Ala Val Ser Gly Tyr His Leu 

L 3 0 13 5 19 0 

eac ttc att gac gaa gga cgc aat tea gge gga cac gtt ttt gac tat 

His Phe He Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 

195 200 205 

gtg ett gag gat tgc acq gtt aeg att tct eaa aaa atg aac atg aat 

Val Leu Glu Asp Cys Thr Val Thr He Ser Gin Lys Met Asn Met Asn 

2 1 0 215 2 2 0 

etc aga ctt ccg aac aca gcg gat ttc ttt aat gcg aat ctg gat aac 

Leu Arc Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 

22 5 23 0 235 240 

cct gat ttt gcg aaa gat ato gaa aea act iaa gga age set gaa taa 

Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu Gly Ser Pro Glu 

2 45 250 255 
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< 2 1 1 > 2 5 5 
<212> PRT 

< 2 1 3 > Bacillus sabtilis 



Me: Lys Arg Glu Ser Asn lie Gin Val Leu Ser Arg Gly Gin Lys Asp 

1 5 10 15 

Gin Pro Val Ser Gin lie Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 

2 0 2 5 3 0 

Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu lie Pro Lys Tyr 

3 5 4 0 4 5 

Gly Asp Phe Gly lie Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu lie 

5 0 5 5 6 0 

Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 

6 5 7 0 7 5 8 0 

Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Gys Ser Phe Thr Phe Phe 

8 5 9 0 9 5 

Thr Pro Asp Met. Thr His Lys lie Asp Ala Lys Met Thr Arg Glu Asp 

1 0 0 10 5 110 

Phe Glu Lys Glu lie Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 

115 12 0 125 

Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 

13 j 13 5 1 4 n 



Glu Leu Gin Glu Lys Pro Tyr Val Pro Met Val Glu Ala Val Lys Thr 
14 5 15 0 155 16 0 

Gin Pro lie Phe Asn Phe Asp Asn Val Arg Gly Thr lie Val Gly Phe 

165 170 175 

Leu Thr Pro Ala Tyr Ala Asn Gly lie Ala Val Ser Gly Tyr His Leu 

13 0 13 5 19 0 

His Phe lie Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 200 2 05 

Val Leu Glu Asp Gys Thr Val Thr lie Ser Gin Lys Met Asn Met Asn 

210 215 220 

Leu Arg Leu Pro Asn Thr Ala Asp Phe Phe Asn Ala Asn Leu Asp Asn 
22 5 230 235 24 0 

Pro Asp Phe Ala Lys Asp lie Glu Thr Thr Glu Gly Ser Pro Glu 

245 250 255 
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< 2 2 2 > ( 1 ) . . ( 1 2 5 1 ) 

< 4 :■ 0 > 7 0 

atg aca ttc tec ctt ttt ggt gac aaa ttt acc cgc cac tec ggc att 

Met Thr ?he Ser Leu Phe Giy Asp Lys Phe Thr Arg His Ser Gly lie 

15 10 15 

aeg ctg ttg atg gaa gat etg aac gac ggt tta cge acg cct ggc gcg 

Thr leu leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 

2 0 2 5 3D 

att. atg etc ggc ggc ggt aat ecg geg cag ate eeg gaa atg eag gae 

He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 

3 5 4 J 4 5 

tae ttc eag acg eta ctg acc gae atg etg gaa agt ggc aaa geg act 

Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 

5 0 5 5 5 0 

gat gca ctg tgt aac tae gac ggt cea eag ggg aaa acg gag eta etc 

Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 

6 5 7 0 7 5 8 0 

aea etg ctt gec gga atg ctg cge gag aag ttg ggt tgg gat ate gaa 

Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp lie Glu 

8 5 9 0 9 5 

cca eag aat att gca eta aca aac ggc age eag age gcg ttt tte tae 

Pro Gin Asn lie Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 

LOO 105 110 

tta ttt aac etg ttt gee gga ege cgt gee gat ggt egg gte aaa aaa 

Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Va 1 Lys Lys 

115 12 0 125 

gtg etg ttc ecg ctt gca eeg gaa tae att ggc tat get gae gee gga 

Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala Gly 

130 135 140 

ctg gaa gaa gat etg ttt gte tct geg cgt ecg aat att gaa ctg ctg 

Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn lie Glu Leu Leu 

145 150 155 160 

eeg gaa ggc cag ttt aaa tae cac gte gat ttt gag eat etg cat att 

Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His He 

165 17 0 175 

gge gaa gaa ace ggg atg att tge gte tee egg eeg acg aat cea aca 

Gly Glu Glu Thr Gly Met He Cys Val Ser Arg Pro Thr Asn Pro Thr 

13 0 13 5 19 0 

gge aat gtg att act gae gaa gag ttg etg aag ctt gae geg etg ggc 

Gly Asn Val He Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 

L 9 5 2 0 0 2 0 5 

aat eaa cac ggc att eeg etg gtg art gat aac: get tat gge gte ecg 

Asn Gin His Giy He Pro Leu Val lie Asp Asn Ala Tyr Gly Val Pro 

21 J 215 220 

ttc eeg ggt ate ate tte agt gaa gcg eg: ecg eta tgg aat eeg aat 

Phe Pro Giy lie lie Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
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225 



230 



235 



240 



ate gtg ctg tgc atg agt ctt tec aag ctg ggt eta cct gge tec cgc 
lie Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 

2 4 5 2 5 0 2 5 5 



768 



tge ggc att ate ate gec aat gaa aaa ate ate ace gee ate aec aat 
Cys Gly lie lie lie Ala Asn Glu Lys lie lie Thr Ala lie Thr Asn 

260 265 270 



816 



atg aac ggc att ate age ctg gca cct ggc ggt att ggt ccg gcg atg 
Met Asn Gly lie lie Ser Leu Ala Pro Gly Gly lie Gly Pro Ala Met 
275 280 285 



864 



atg tgt gaa atg att aag cgt aac gat ctg ctg cgc ctg tct gaa aca 
Met Cys Glu Met lie Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 

290 295 300 



912 



gte ate aaa ccg ttt tac tac cag cgt gtt cag gaa act ate gee ate 
Val lie Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr lie Ala lie 
305 310 315 320 



96 0 



att cgc cgc tat tta ccg gaa aat cgc tgc ctg att cat aaa ccg gaa 
lie Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu lie His Lys Pro Glu 

325 330 335 



10 08 



gga gee att ttc etc tgg eta tgg ttt aag gat ttg ccc att acg aec 
Gly Ala lie Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro lie Thr Thr 

340 3 45 35 0 



1056 



aag cag etc tat eag cgc ctg aaa gea cgc ggc gtg ctg atg gtg ccg 
Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 
355 360 365 



1104 



ggg cac aac ttc ttc cca ggg ctg gat aaa ccg tgg ccg cat acg cat 
Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 



1152 



caa tgt atg cgc atg aac tac gta cea gag ccg gag aaa att gag gcg 
Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys lie Glu Ala 
385 390 395 400 



12 00 



ggg gtg aag att ctg gcg gaa gag ata gaa aga gee tgg get gaa agt 
Gly Val Lys lie Leu Ala Glu Glu lie Glu Arg Ala Trp Ala Glu Ser 

405 410 415 



1243 



cac taa 
His 



1254 



<210> 71 

<:211> 417 

<212> PRT 

<213> Escherichia coli 

<4 0 0> 71 

Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly lie 
1 5 10 15 

Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 

20 25 30 
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He Met Leu Gly Gly Gly Asn Pro Ala Gin lie Pro Glu Met. Gin Asp 
3 5 4 0 



Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
5 j 5 5 6 J 

Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 

6 5 7 J 7 5 3 0 

Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu GLy Trp Asp He Glu 

3 5 9 0 9 5 

Pro Gin Asn lie Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 

1 0 0 10 5 11 0 

Leu Phe Asn Leu Phe Ala Gl.y Arg Arg Ala Asp Gly Arg Va 1 Lys Lys 
115 120 125 

Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala Gly 
1 3 0 13 5 1 4 J 

Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn lie Glu Leu Leu 
14 5 15 0 15 5 16 0 

Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His He 

16 5 17 0 17 5 

Gly Glu Glu Thr Gly Met lie Cys Val Ser Arg Pro Thr Asn Pro Thr 

1 3 0 13 5 19 3 

Gly Asn Val lie Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 

19 5 2 0 0 2 0 5 

Asn Gin His Gly lie Pro Leu Val lie Asp Asn Ala Tyr Gly Val Pro 
210 215 2 2 0 

Phe Pro Gly lie lie Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
22 5 23 0 23 5 240 

lie Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 

24 5 250 255 

Cys Gly lie lie lie Ala Asn Glu Lys He lie Thr Ala He Thr Asn 

2 6 0 2 6 5 2 7 0 

Met Asn Gly lie He Ser Leu Ala Pro Gly Gly lie Gly Pro Ala Met 

2 7 5 2 3 0 2 8 5 

Met Cys Glu Met He Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
2 9 0 2 9 5 3 0 0 



Val He Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr He Ala He 

3 j 5 3 1 C 3 15 3 2 0 

He Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu He His Lys Pro Glu 

325 33 0 335 

Gly Ala He Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro lie Thr Thr 

3 4 0 3 4 5 3 5 0 
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Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met: Val Pro 
355 360 365 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pr:> Trp Pro His Thr His 
370 375 380 

Gin Gys Met Arg Met Asn Tyr Val Pro Glu Pro Giu Lys lie Glu Ala 
3^5 390 39 5 400 

Gly Val Lys lie Leu Ala Glu Glu lie Glu Arg Ala Trp Ala Glu Ser 

435 410 415 



< 2 1 0 > 7 2 
<211> 3803 
<2 12^ DNA 

<2 1 .3 » Artificial Sequence 
<2 2 }.> 

<22 3-> Description of Artificial Sequence: Recombinant 
OAN2 9 4 plasmid 

< 4 J 0 > 7 2 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 

tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 12 0 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa LSD 

ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 24 3 

gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 30) 

tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggctgaag aagccatcac 3 60 

gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 

catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 4 SO 

gcatacttat gcgcggctga aggagcccat gtcctgctgt cggctagacg cgaggatcgt 540 

ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct 6 )0 

ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 

gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacattggat 720 

gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 7 SO 

cttccgcaaa tgcttgagca aaaaaaggga catatcatca atatcgcctc tcaagcgggg 84 0 

aaaatcgcca caccgaagtc tagccigta? tccgcgacra aara^grrgr grtaggttac 9 10 

tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 

ggcccgattc agacggactt tttttccat~ gctgataaag gcggggacta cgccaaaaat 1)20 

gtcggccgct ggatgcttga tcctgatgac gtggcagctc aaattacagc tgcaattttt 1 !! 0 
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acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa gctgratcag 1140 

ctgtttceag ctcttgtaga aaagctggca ggacgcgcgc rcatgaaaaa ataargatag 1 0 X> 

aactgcctgt ggtggagtgg cttgtttctc acggggcagt ttttgatagt ggaagggaga 1060 

gattgtcgaa cgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 13. J 

cagaaactga ttgatgcaga gcctgtattc tgggcaaatt caggcaagaa agaggggcca 1 1 -> J 

ttaocccgtg cagatgagtg ggcaaccgag atagcggaag cggaaaaaag aatgcagcgg 144") 

tttgcacctt acattgccga ggtgtttcct gagacgaaag gcgctaaagg aateatcgag 160 J 

tc:ccgcttt ttgaggtgca gcatatgaag ggaaagctgg aagcggcata tcagcagcca 166 j 

tt:cccggaa gatggctttt aaagtgcgac catgagcttc cgatttcagg accgattaaa 16.0 

gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct toaagaagga 16 -J 

atgcttcagg aaaccgatga t'accgcatc ttacaggaag agcggtttac cgggtttttc 174 J 

tcccgctatt cgattgctgt cggttcgaca ggaaatctag gtttaagcat cggcatcatc 16 ':0 

ggcgcggcac tcgggtttcg cgtgacagtg catatgtccg ccgatgctaa gcagtggaaa 1^60 

aaggatctcc tccgccaaaa gggagtcact gttatggagt acgaaacaga ttacagtgaa 190") 

gcggtgaacg aagggagacg gcaggcggaa caagatccat tctgttattt tattgatgac 1 i-* ■-■ J 

gaacattctc gtcagctgtt c:taggatat gctgttgetg caagccgatt aaaaacacag .0.(4 j 

cttgactgta tgaatataaa gccaagtctt gagacgccct tgtttgtgta tctgccgtgc Ol^O 

ggagtcggcg gaggaccggg cggtgtagca tttgggctga agcttttata cggagatga: 0160 

gttcatgtgt ttttcgcaga accaactcat tcaccttgta tgctgttagg gctttattca 0 00 0 

ggacttcacg agaagatctc cgteeaggat atcggcctgg ataatcagac ggctgctgac 00 00 

ggacttgccg tagggaggcc grcaggattt gtcggcaagc tgattgaacc gcttctgagc 00 4 J 

ggc:gttata cggtagagga caacacgctt tatacttcgc ttcatatgct ggctgtarc: 0 4-Xj 

gaagataaat atttagagcc c:c:gctctt gctggcatgt tcgggccggt teagcttttt 0460 

tcgacagaag agggaaggcg c:a:gc:cag aaata:aaga tggaacatgc cgtacatg:c 2b. .0 

gtctggggaa cgggaggaag catggtccca aaagatgaaa tggctgcgta taaccgaa:c 2 0-0' 

ggtgctgatt tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttttccgc 2640 

tacaatcttt gtaagggttt taacctacag agagtcaggt gtaaacagtg aaaaataaag 2^'"0 

aacitaacc: acatacttta tatacacagc acaateggga gtcttggtct ggttttgggg 27X 1 

ggcatttgtc gattgctgta tctgaagaag aggcaaaagc tgtggaagga ttgaatgat: ZB'l.j 

atctatctgt tgaagaagtg gagacgatct a tat tec get tgttcgcttg cttcatttac 2 ^ X' 
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1 1 1 .I 



■ ) i 



atgtcaagtc tgcggctgaa cgcaa:aagc atgtcaatgt ttttttgaag cacccacatt 294 

cagccaaaat tccgtttatt atcggcattg ccggcagtgt cgcagtcgga aaaagcacga 3 " 

cggcgcggat cttgcagaag ctgcrttcgc gtttgcctga ccgtccaaaa gtgagcctta 3 0 6 \ 

tcacgacaga tggtttttta tttcc:actg ccgagctgaa aaagaaaaat atgatgtcaa 3111'' 

gaaaaggatt tcctgaaagc tatgatgtaa aggcgctgct cgaatttttg aatgacttaa 3i-\ 

aatcaggaaa ggacagcgta aaggccccgg tgtattccca tctaacctat gaccgcgagg 324\ 

aaggtgtgtt cgaggttgta gaacaggcgg atattgtgat tattgaaggc at:aatgttc 330- 

ttcagtcgcc caccttggag gatgaccggg aaaacccgcg tatttttgtt tccgatttct 33 6v 

ttgatttttc gatttatgtg gatgcggagg aaagccggat tttcacttgg tatttagagc 342: 

gttttcgcct gctrcgggaa acagcttttc aaaatcctga ttcatatttt cataaattta 343; 

aagacttgtc cgatcaggag gctgacgaga tggcagcctc gatttgggag agtgtcaacc 354v 

ggccgaattt atatgaaaat attttgccaa ctaaattcag gtcagatctc attttgcgta 36 r . 

agggagacgg gcataaggtc gaggaagtgt tggtaaggag ggtatgaaat gtgctgcagc 3 60 

tcgagcaata gttaccctta ttatcaagat aagaaagaaa aggatttttc gctacgctca 372 

aatcctttaa aaaaacacaa aagaccacat tttttaatgt ggtctttatt cttcaactaa 3730 

agcacccatt agttcaacaa acgaaaattg gataaagtgg gatattttta aaatatatat 3340 

ttatgttaca gtaatattga cttttaaaaa aggattgatt ctaatgaaga aagcagacaa 3 9 0 0 

gtaagcctcc caaattcact ttagataaaa atttaggagg catatcaaat gaactttaat 3900 

aaaattgatt tagacaattg gaagagaaaa gagatattta atcattattt gaaccaacaa 4020 

acgactttta gtataaccac agaaattgat attagtgttt tataccgaaa cataaaacaa 403 0 

gaaggatata aattttaccc tgcatttatt ttcttagtga caagggtgat aaactcaaat 4140 

acagctttta gaactggtta caatagcgac ggagagttag gttattggga taagttagag 4200 

ccactttata caatttttga tggtgtatct aaaacattct ctggtatttg gactcctgta 4260 

aagaatgact tcaaagagtt ttatgattta tacctttctg atgtagagaa atataatggt 4 3 2;? 

tcggggaaat tgtttcccaa aacacctata cctgaaaatg ctttttctct ttctattatt 4330 

ccatggactt catttactgg gtttaactta aatatcaata ataatagtaa ttaccttcta 4440 

cccattatta cagcaggaaa attcattaat aaaggtaatt caatatattt accgctatct 4500 

ttacaggtac atcattctgt ttgtgatggt tatcatgcag gattqtttat qaactctatt 4660 

caggaattgt cagataggcc taatgactgg cttttataat atgagataat gccgactgta 462 j 

ctttttacag tcggttttct aatgtcacta acctgccccg ttagttgaag aaggttttta 468 j 

tattacagct gtcgactcgt gatcttcgga caggctgttc agctttttct caatgcgatc 474 0 
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cagctgcgct tttcggtttt tcgcatactt gaagcctgta acagccgcaa agacgacagc 4P00 
ggcaaatata ataaatacaa acagctgaaa catcacatca cctatattca tgttcttcac 4 & 6 3 
ctcatgtttg cgggagagat tcattctctt ccgtttttta tttaaagcgg cttttccaga 492 3 
cgggaacggt gttttgtggt ctccattttc atttgccgat aggcgaacgc taaaaatggc 4930 
aggccgagca gggtaatgcc gctcaggaca gaaaaaatat aaatcggccg gccagcgcca 5040 
aacaggtcta tacatatccc cccgacccaa gggccgatga cgtttccgag ctgtggaaaa 5100 
ccgattgccc cgaaataagt gccttttaat cctggttttg caatctggtc tacatacaaa 5160 
tccatcatag agaataaaag cacttcgccg attgtaaatg tgatgacaat catcacaatt 5220 
gatggaacac cgtgtgatac ggtgaaaatg gccatgctga tgctaaccat cacattaccg 52*0 
agcatcagag aacaaagcgg cgaaaaccgt tttgcaaaat ggacaatggg aaattgcgtc 5340 
gccaacacaa cgattgcgtt taatgtcagc atcagcccat acagcttcgt tccattgccg 5400 
atcaaggggt tctgcgccat atactgaggg aatgtggaac tgaattgtga gtagccgaag 5400 
gtgcatagcg taatgccgac caaagcaatg gtaaaaagat aatccttttg cgtgaccata 5520 
aacgcttccc gcacgctcat atttcgggac tgggctggtg ctgataagga tggatgtttt 5 5*0 
ttaaattgga gggcaagcac aattccgtat agtccgtaaa tgactgcagg caccaaaaag 5640 
ggcgtagtcg attgcgatga gccgaaatat aggccaagca caggtccgaa gacaacgccg 5700 
atattaatag ccgcatagcg taaattaaaa actagcagtc tcgttttttc ttctgtcata 576 D 
tcagacaaca aggcctttga agcgggctca aacagtgatt tgcaaagacc gtttaatgcg 5820 
tttactacaa aaaacaccca gagattagat gctgccgcaa agcctgcaaa taccagcatc 58*0 
catccgaaaa tcgatacaag catcatgttt tttctgccga atttatctga gatatatccg 5940 
ccgtaaaagc ttgcgaggat gccgactgat gagctcgcgg cgatgaccag ccctgcatag 6000 
gaagctgatg cgccttggac ggctgtcaaa taaatcgcta aaaaaggaat gctcatcgat 6000 
gttgccattc tgccgaaaat ggttccgatt ataattgtac gcgttggatg catagcttga 6120 
gtattctata gtgtcaccta aatagcttgg cgtaatcatg gtcatagctg tttcctgtgt 61*0 
gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag 6240 
cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt 6300 
tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 63^ j 
gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 642 j 
ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 64^: 
caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 654 j 
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aaaaggccgc gttgctggcg tttttcgata ggctccgccc ccctgacgag catcacaaaa 6 6 0 0 

atcgacgctc aagtcagagg tggcgaaacc cgacaggact acaaagatac caggcgtttc 6660 

cccccggaag ctccctcgtg cgctctcctg :tccgaccct gccgcttacc ggatacctgt 67„0 

ccgc:t::o: ccct:cggga agcgtggcgc :ttctca:ag ctcacgctgt aggtatctca 67 ; ? 

gt tcggngra ggtcgt tcgc tccaagccgg gctgtgcgca cgaacccccc gttcagcccg 6^4 0 

accgccgcgc cttacccggt aacta:cgrc ttgagtccaa cccggtaaga cacgac:tat 6901' 

cgccactggc agcagccact ggtaacagga ttagcagage gaggtatgta ggcggtgcta 69 6? 

cagagttctc gaagtggtgg cctaactacg gctacac^ag aaggacagta tttggtatct 701? 

gcgctccgcc gaagccagct accttcggaa aaagagctgg tagctcttga tccggcaaac 7 0-' ■'? 

aaaccaccgc tggtagcggt ggtttrtttg tttgcaagca gcagattacg cgcagaaaaa 7140 

aaggatctca agaagatcct ttgatctctc ctacggggtc tgacgctcag tggaacgaaa 710 0 

actcacgtra agggattttg grcatgagat tatcaaaaag gatctccacc tagatccttt 716 0 

:aaat:aaaa atgaagtctt aaatcaa:cc aaagcatata tgagtaaact tggtctgaca 7'jL.O 

gttaccaa:g cttaatcagt gaggcacctia tct cage gat: ctgtctattt cgttcatcca 7 3-0 

tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 7440 

ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta :cagcaataa 7 5 00 

ac cage cage eggaagggee gagegcagaa gtggtcctgc aactttatcc gcctccatcc 7 0 6 0 

agtctattaa ttgttgccgg gaagctagag taagtagttc gecagctaat agtttgcgca 7 610 

acgttgttgg cattgetaca ggcatcgegg tgtcacgctc gtcgtttggt atggcttcat 7 6 0 0 

tcagctccgg ttcccaacga teaaggegag ttacatgatc ccccatgttg tgcaaaaaag 774 1 ? 

eggttagetc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 70.?-:? 

tcatggttat ggcagcactg cataa:tc:c ttactgtcat gccatccgta agatgetttt 736?' 

ctgtgactgg tgagtactca accaagtca: tctgagaata ccgcgcccgg cgaccgagtt 7910 

gc:cttgccc ggegtcaata egggataata gtgtatgaca tagcagaact ttaaaagtgc 7 9- : 0 

tcatcattgg aaaacgttct teggggegaa aactctcaag gatcttaccg ctgttgagat 304 0 

ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 81 "0 

gcgtttctgg gtgagcaaaa acaggaaggc aaaatgeege aaaaaaggga ataagggega 816? 

caeggaaatg ttgaatac:c azaczczzcc tttttcaata ttattqaaqc atttatcagg 811? 

gt:attgtct catgagegga tacatatttg aatg:a:::a gaaaaataaa caaatagggg 81 0 

:tccgcgcac azzzc.ee cga aaagtgccac ctgtatgcgg tgegaaatae cgcacagatg 

cgtaaggaga aa.aza.ecge a teaggegaaa ttgtaaacgt taatattttg ttaaaattcg ^ 



^ — * ^ 
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cgtcaaatan ttgttaaatc agctcatttt ttaaccaata ggccgaaatc ggcaaaatcc 846 

c:tat,aaatc aaaagaatag accgajatag ggttgagtgt tgttccagtt tggaacaaga 352 

gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg 35 ri 

a:ggcccact acgtgaacca tcacccaaat caagtttttt gcggtcgagg tgeegtaaag 3 6 4 

ctctaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga aagccggcga 37 j 

acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg ctggcaagtg 3 7 6 

tagcggtcac gctgcgcgta accaccacac ccgccgcgct taa 3 3 j 

< 2 L 3 > 7 3 
<2 11> 33 2 0 
<2 12> DNA 

< 2 1 : > Artificial Sequence 

< . 1 2 0 > 

<22:. > Description of Artificial Sequence: Recombinant 
pAN2 9 6 plasmid 

<4 0 0> 7 3 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 

tcggtgcggg cctcttcgct attacgecag ctggcgaaag ggggatgtgc tgcaaggcga 12 0 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 130 

ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 240 

gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 3 00 

tttaacgatg tcagaaacgg caggccacct tgatgtgttg gaggctgaag aagccatcac 3 60 

gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 

catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc 430 

gcatacttat gcgcggctga aggagcccat gtcctgctgt cggctagacg cgaggatcgt 5 4 2' 

ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct 6 JO 

ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 

gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacattggat 720 

gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 7:0 

cttccgcaaa tgcttgagca aaaaaaggga catatcatca atatcgcctc tcaagcgggg 34 

aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatn--cgr. gttaggttac 9 

tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 

ggcccgattc agacggactt tttttccatt gctgataaag gcggggacta cgccaaaaat 1 

gtcggccgcc ggatgcttga tcctgatgac gtggcagctc aaattacagc tgcaattttt 1 
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acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa gctgtatcag 1140 

ctgtt:c:cag ctcttgtaga aaagctggca ggacgcgcgc t:a:gsaaaa a caatgatag 1 2 ") j 

aactgcctgt ggtggagtgg cttgtrtctc acggggcagt tcttgatagt ggaagggaga 126!' 

gattgtcgaa tgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 13 2 3 

cagaaactga ttgatgcaga gcctgtattc tgggcaaatt caggcaagaa agaggggcca 13 -O 

ttaccccgtg cagatgagtg ggcaaccgag acagcggaag cggaaaaaag aatgcagcgg 144 3 

tttgcaccct acattgccga ggtgtt'cct gagacgaaag gcgctaaagg aatcatcgag 1 5 ~ ] 3 

tctccgctnt ttgaggtgca gcat:atgaag ggaaagctgg aagcggcata tcagcagcca 15^ -j 

tttcccggaa gatggctttt aaagtgcgac catgagcttc cgatttcagg atcgattaaa 16„j 

gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct tcaagaagga 16-; 3 

atgctccagg aaaccgatga ttaccgcatc tcacaggaag agcggtttac cgggtttttc 174 3 

tcccgctact cgattgctgt cggctcgaca ggaaatctag gtttaagcat cggcatcatc l'-O'O 

ggcgcggcac tcgggtttcg cgtgacagtg catatgtccg ccgatgctaa gcagtggaaa IS 6 0 

aaggatctcc tccgccaaaa gggagtcact gttatggagt acgaaacaga ttacagtgaa 10 2 3 

gcggtgaacg aagggagacg gcaggcggaa caagatccat tccgtcattt tattgatgat 1 

g.-L^CcLZZczc gtcagccgt t c ttaggatat gctgctgc^g caagccgatt aaaaacacag 

cttgactgta tgaatacaaa gccaagtctt gagacgccct tgnttgtgra tctgccgtigc 2 100 

ggagccggcg gaggaccggg cggtgtagca tctgggccga agc:tt:a:a cggagatgat 2 160 

gttcatgtgt ttttcgcaga accaactcat tcaccttgta tgctgttagg gctttatrca 2220 

ggacttcacg agaagacctc cgtccaggat atcggcccgg ataatcagac ggctgctgac 2 2'-0 

ggacttgccg tagggaggcc gtcaggactt gccggcaagc tgactgaacc gctcctgagc 23 40 

ggctgttata cggtagagga caatacgctt tatac":tgc tccatatgct ggctgtatct 240 3 

gaagataaat attitagagcc ctctgctctt gctggcatgt tcgggccggt tcagc:t:tt 24-5 0 

tcgacagaag agggaaggcg ctatgctcag aaatataaga tggaacatgc cgtacatgtc 2-320 

gtc tggggaa cgggaggaag catggttcca aaagatgaaa tggc tgcgta taaccgaatc 2 3 - 3 

ggtgctga" tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttt'ccgc 264 3 

tacaatc::: gtaagggtt" taacctacag agagccaggt gtaaacagng aaaaa:aaag 270 3 

aacttaacc: acatacctta tatacacagc acaatcggga g:c:tc:gca gctcgagcaa 276 3 

tagttaccc: :a:tatcaag ataagaaaga aaaggatttt tcgctacgct caaatccttt 2h2 0 

aaaaaaacac aaaagaccac att:tt :aa: gtggtc::ta ttcti^caact: aaagcaccca 2 9 8 2 
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ttagctcaac 
cagtaatar: 
cctaaattca 
ttta gacaa: 
tagta taacc 
taaattttac 
tagaactgg t 

cttcaaagag 
attgtttccc 
ttcatttact 
tacagcagga 
acatcattct 
gtcagatagg 
agtcggtttt 
ctgtcgacta 
ctgtatgtct 
ggt tataaag 
ctagtgattg 
ggtctgaacc 
aaat t gacgg 
gac gggcat z 
c caagctgtt 
tgcgcttttc 
aatataataa 
tgtttgcggg 
aacggtgtc z 
cgagcagggt 
ggtctata ca 
ttgccccgaa 
tcatagagaa 



aaacgaaaat 
gact t t taaa 
c Z Z Z aga taa 
t ggaagagaa 
ac agaaattg 
cctgcattta 
tacaata gcg 
gatggtgtat 
ttttatgatt 
aaaacaccta 
gggtttaac t 
aaat teat ta 
g 1 1 tgtga tg 
cc taa t gac z 
c taatg tc ac 
aggtcgagga 
c tga 1 1 tgga 
agtatcaaaa 
tgcaggcgaa 
atetegcatt 
caaaaggc ta 
a tgcag tc t z 
aatcg tgatc 
ggtttttcgc 
a tacaaa cag 
agagat tea t 
tgtggtctcc 
aa tgeege t c 
ta tccccccg 
ataagtgee t 
t aaaagcac t 



t ggataaagt 
aaagga t tga 
aaat t tagga 
aagagatatt 
atat tagtgt 

aeggagagt t 
c taaaacat t 
tatacc 1 1 tc 
ta c ctgaaaa 
taaa ta tcaa 
ataaaggtaa 
gttatcatgc 
ggcttttata 
taacc tgece 
agtgt tggta 
ggcgtc tagg 
atggagc tea 
agagceattt 
tc a tge tgaa 
tcgtgtgttg 
ttgtgaggat 
t teggacagg 
atact tgaag 
c tgaaaeatc 
tc te t tcegt 
attttcattt 
aggacagaaa 
acccaa gggc 
tttaatcctg 
tegecgattg 



ggga tat 1 1 1 
1 1 z taatgaa 
ggcatat caa 
taatcattat 
1 1 tataccga 
gacaagggtg 
aggt ta 1 1 gg 
etc tggta tt 
tgatgtagag 
tgetttttet 
taataatagt 
1 1 caa ta tat 
aggat tgt 1 1 
a ta tgaga ta 
cgttagt tga 
aggagggtat 
egg 1 1 1 tggg 
ggea tcag e t 
etagagcegg 
t ceaagc 1 1 c 
taccgagaea 
ccagaccgga 
ctgttcagct 
cctgtaacag 
acatcaceta 
tttttattta 
gecgatagge 
aaa tat aaat 
cga tgacgt t 
gttttgcaat 
taaatgtgat 



taaaa ta ta t 
gaaagcagac 
atgaacttta 
ttgaaccaac 
aaeataaaac 
ataaac tcaa 
ga taag t tag 
tggac tc ctg 
aaa tataatg 
ctttctatta 
aa t tace t tc 
ttacegctat 
a tgaae tc ta 
a tg ccgactg 
aga aggt 1 1 1 
gaaa tgtgca 
gc tggt tc 1 1 
ggaagaaaga 
aacac cacag 
aagtcgatca 
ggc a t cc 1 1 1 
t taaggtaga 
ttttc tcaat 
ccgcaaagac 
tattcatgtt 
aagc ggc 1 1 1 
gaa cgc taaa 
cggccggcca 
tec gage tgt 
c t gg t c taca 
aacaa tea t c 



a 1 1 ta t gt ta 
aagtaagee t 
ataaaattga 
aaacgac 1 1 1 
aagaaggata 
atacagcttt 
agecacttta 
taaagaatga 
gttcggggaa 
ttccatggac 

ct t tacaggt 
ttcaggaatt 




t cata t tgaa 
aaaagaac 1 1 
tegtttttae 
atgccgagtc 
gatgaetgaa 
tgccggagga 
gctcgttgcc 
gega tccagc 
gaeageggea 
cttcacctea 
tecagaeggg 
aatggcaggc 
gcgccaaaca 
ggaaaaccga 
tacaaatcca 
a caa t tgatg 
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gaacaccgtg tgatacggtg aaaatggcca tgctgatgct aaccatcaca ttaccgagca 43? 

t-ragagaaca aagcggcgaa aaccgtttrg caaaatggac aatgggaaat tgcgtcgcca 4 8 6 

acacaacgat tgcgtttaat gt cage a tea gcccatacag cttcgttcca ttgecgatea 4'^j 

a '? , 3' 3'? - -ctg cgccatatac tgagggaacg tggaactgaa ttgegagtag ccgaaggcgc 4'? 

atagc?:aa: gccgaccaaa gcaatggtaa aaagataatc etttcgegtg accataaacg 504 

cttcccgcac geccatatt: egggaceggg ctggtgctga taaggacgga tgtcttttaa 5 L "j 

attggagggc aagcacaat: ccgtatagtc cgtaaa:gac tgcaggcacc aaaaagggcg 5 16 

tagtcgattg cgacgagccg aaatataggc caagcacagg tccgaagaca aegecgatat 52 2 

taatagcege atagegtaaa ttaaaaacra geagtctegt tttttcttct gtcatatcag 52 ; - 

acaacaaggc etttgaageg ggctcaaaca gtgatttgea aagaccgttt aatgcgttta 5 34 

c:acaaaaaa cacccagaga ttagatgctg ccgcaaagcc tgeaaatace agcatccatc 54 !■ 

cgaaaaccga tacaagcatc atgtttcttc tgccgaattt atctgagata tatccgccg: 545 

aaaagcctgc gaggatgecg actgacgagc regeggegat gaccagccc: gcataggaag 5 52 

c:gatgcgcc ttggacggcc gtcaaacaaa ccgctaaaaa aggaatgetc atcgatgttg 5 5 

ccattctgcc gaaaatggtt ccgatta^aa ttgtaacgcg ttggatgca" agcttgagta 5 54 

::c:a:agtg tcacctaaar agcrtggcgt aatcatggtc atagctgtt: cctgtgtgaa 5"0 

a::gt:atcc gctcacaatt cca.ca.c.3.a.ca. racgagcegg aagcataaag tgtaaagee: 576 

ggggtgccta at gag!: gage taactcacat taattgcgtt gcgctcactg cccgctttcc 5 c . 2 

agtcgggaaa cctgtcgtgc cagctgeatt aatgaategg ccaacgcgcg gggagaggcg 5-: 8 

gtttgegtat tgggegctet tccgcttcct cgctcactga ctcgctgcgc tcggtcgctc 594 

ggctgcggcg ageggtatea gcccacccaa aggeggtaat aeggttatec acagaatcag 5 0 

gggacaaege aggaaagaac atgtgagcaa aaggecagea aaaggecagg aacegtaaaa 6 jo 

aggecgegt" gctggcgttt ttcgataggc :ccgcccccc tgacgagcat cacaaaaatc 6 12 

gaegctcaag tcagaggtgg cgaaacccga caggaccata aagataccag gcgtttcccc 618 

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gettacegga tacctgtccg 624 

cc tttctccc ttegggaage gtggcgcttt ctcatagctc aegctgtagg tatctcagtc 65 0 

cggtgtaggt eg t teg c tec aagctgggct gtgtgcacga acececcgtt cagcccgacc 6 .? 6 

gctgcgcctt ateeggtaac tategtcttg agtccaaccc ggtaagacac gaettatege 642 

cactggcagc agccactggt: aacaggatta geagagegag gtatgtaggc ggtgctacag 64 8 

agttcttgaa gtggtggcct aactaegget acactagaag gacagtattt ggtatctgeg 6 4 
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ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc ggcaaacaaa 6600 

ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc agaaaaaaag 666 J 

ga:ctcaaga agatcctttg atcttttcta cggggtcrga cgctcag:gg aacgaaaact 672!- 

cacg::aagg gattttggtc atgagartat caaaaaggat cc:cacc:ag atccttttaa 67 30 

a::aaaaa:g aagttttaaa tcaacctaaa gtatatatga gtaaact:gg tcigacagtt 664 !< 

accaa:gc:: aatcag:gag gcacccatct cagcgatctg tc:at:tcgt tca:ccatag 690 !■ 

::gcc:gac: ccccg:cg:g taga:aac:a cgatacggga gggctcacca tctggcccca 696 j 

g:gc:gcaa: gataccgcga gacccacgct caccggctcc agatttatca gcaataaacc 702 j 

agccagccgg aagggocgag cgcagaagcg gtcctgcaac ttcatccgcc tccatccagt V ij - 0 

c:a::aat:g ttgccgggaa gctagagtaa gtagttcgcc ag:taatagc ttgcgcaacg 7140 

c:g:cggcat tgctacaggc atcgtggtgt cacgctcgtc gtttggtatg gc::cattca 72 ~> !■■ 

gctccggtcc ccaacgatca aggcgagtta catgatcccc catgtcgtgc aaaaaagcgg 72 6 0 

::agc:cc:t cggtccccog atcgttgtca gaagtaagt: ggccgcagcg ttatcactca 7320 

cggtcatggc agcactgcat aattctctta ctgtcatgcc atccg:aaga tgc:cttctg 7 3 3 0 

:gac:ggtga gtactcaacc aagtcartct gagaataccg cgcccggcga ccgagttgct 7440 

c^tgcccggc gtcaatacgg gataatagtg targacatag cagaactcta aaag:gctca 7600 

:ca:tggaaa acgttcttcg gggcgaaaac :c:caaggat cttaccgctg ttgagatcca 76 60 

gttcgatgta acccactcgt gcacocaact gancttcagc atcttttact ttoaccagcg 7620 

tctctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata agggcgacac 7 6 60 

ggaaatgttg aatactcaca ctcttccttt ttcaatatta ttgaagcant tatcagggtt 7740 

a:tgcctcat gagcggatac atatttgaat gtacttagaa aaataaacaa ataggggttc 7600 

cgcgcacatt tccccgaaaa gtgccacctg tatgcggtgt gaaataccgc acagatgcgt 766 0 

aaggagaaaa taccgcatca ggcgaaattg taaacgttaa ta:tttgtta aaatccgcgt 792 j 

taaa^acttg ttaaancagc tcatntttta accaataggc cgaaatcggc aaaanccctt 79 30 

ataaa:caaa agaatagacc gagacagggt tgagtgttgn tccagtt tgg aacaagagtc 304) 

cacrattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg 3 LOO 

gcccactaeg tgaaccatca cccaaatcaa gttttttgcg gtcgaggrgc cgtaaagctc 816 J 

taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaarg 8 2 2!' 

t: g xr gagaaa ggaagggaag aaagcgaaag gagrgggcgc tagggcgctg gcaagtgcag 826!' 

cggrcacg:: gcgcgnaacc a.ccaca.cccg ccg:gct:taa 8 3 2 1: 
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<210> 7 4 

<211> 250 

<212> ?RT 

<2 13> Clostridium acetobutylicun 

<400> 74 

Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 
1 5 10 15 

Lys Val lie Leu Val Leu Asp Val Gly Asn Thr Asn lie Val Leu Giy 

2 0 2 5 3 0 

lie Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 
3 5 4 0 4 5 

Val Leu Arg Ser Ala Asp Glu Tyr Gly lie Gin Val Met Asn Leu Phe 
5 0 5 5 6 0 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val lie lie Ser 
6 5 7 J 7 5 8 0 

Ser Val Val Pro Asn lie Met Tyr Ser Leu Glu His Met lie Arg Lys 

85 90 95 

Tyr Phe Lys He Asn Pro Leu Val Val Gly Pro Gly lie Lys Thr Gly 

10 0 105 110 

lie Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie 
115 120 125 

Val Asn Ala Val Ala Ala His Glu lie Tyr Lys Arg Ser Leu lie lie 
13 3 13 5 14 0 

lie Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala Val Arg Glu Asn Gly 
14 5 150 155 160 

Asp Tyr Leu Gly Gly Ala lie Cys Pro Gly lie Lys Val Ser Ser Glu 

165 170 175 

Ala Leu Phe Glu Lys Ala Ala Lys Leu Pro Arg Val Glu Leu lie Lys 

13 0 185 190 

Pro Ala Tyr Ala lie Cys Lys Asn Thr lie Ser Ser lie Gin Ser Gly 

19 5 230 205 

lie Val Tyr Arg Tyr Leu Arg Gin Val Lys Tyr Leu Phe Glu Lys Leu 

210 215 220 

Lys Glu Asn Leu Pro Asp Gly Arg Arg Thr Arg Thr Ser Leu Val Leu 
22 5 23 0 23 5 24 3 

Ala Thr Gly Gly Leu Ala Lys Leu lie Asn 

2 4 5 2 5 0 



c210 - 75 

-:211- 258 

<2 12^> PRT 

^:2 13~> Rhodobacter caosu 
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Met Leu Leu Cys lie Asp Cys Gly Asn Thr Asn Thr Val Phe Ser Val 

I 5 10 15 

Trp Asp Gly Thr Asp Phe Ala Ala Thr Trp Arg lie Ala Thr Asp His 

2 0 2 5 3 0 

Arg Arg Thr Ala Asp Glu Tyr Phe Val Trp Leu Asn Thr Leu Met Gin 
35 4 0 45 

Leu Lys Gly Leu Gin Gly Arg lie Ser Glu Ala lie lie Ser Ser Thr 
5 D 55 5 0 

Ala Pro Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 

65 7 0 75 80 

Asp Cys Arg Pro Tyr Val Val Gly Lys Pro Gly Cys Glu Leu Pro Val 

3 5 9 J 9 5 

Ala Pro Arg Val Asp Pro Gly Thr Thr Val Gly Pro Asp Arg Leu Val 

10 0 10 5 110 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu lie Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Pro Asp Gly Ala 
13 0 13 5 14 0 

Tyr lie Gly Gly Val lie Ala Pro Gly Val Asn Leu Ser Leu Glu Ala 
14 5 15 0 155 160 

Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 

16 5 17 0) 17 5 

Clin Gly Val lie Gly Thr Asn Thr Val Ala Cys lie Gin Ser Gly Val 

ISO 185 19 0 

Tyr Trp Gly Tyr lie Gly Leu Val Glu Gly lie Val Arg Gin lie Arg 
19 5 200 205 

Met Glu Arg Asp Arg Pro Met Lys Val lie Ala Thr Gly Gly Leu Ala 

213 215 220 

Ser Leu Phe Asp Leu Gly Phe Asp Leu Phe Asp Lys Val Glu Asp Asp 

225 230 235 240 

Leu Thr Met His Gly Leu Arg Leu lie Phe Asp Tyr Asn Lys Gly Leu 

245 25 0 255 

Gly Ala 



<210> 7 6 

<211> 10 301 

< : 2 1 2 ~> DM\ 

<213> Artificial Sequence 
^ ~ 'w . » 

•:223> Description of Artificial Sequence: Recombinant 
pAX2 4 0 piasrr.id 
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<4 0 0> 7 5 

gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacca acggggcagg 60 

Ltagtgacat tagaaaaccg actgcaaaaa gtacagtcgg cattatctca tattataaaa L2 0 

gccagtcatt aggcctatct gacaattcct gaatagagtt cataaacaat cctgcatgat IS J 

aaccatcaca aacagaatga tgcacctgta aagatagcgg :aaata:a:t gaattacctt 24 j 

cattaacgaa ttttcctgct gtaataatgg gtagaaggta attactatta ttattgatat 3 0 j 

:taagt:aaa cccagtaaat gaagtccatg gaataataga aagagaaaaa gcattttcag 36) 

gtataggtgt trtgggaaac aatttccccg aaccantata tittctctaca ncagaaaggt 420 

ataaatcata aaaccctttg aagtcattct ttacaggagt ccaaatacca gagaatgttt 4 --SO 

tagacacacc atcaaaaac: gtataaagrg gctctaac:t atcccaataa cctaactctc 54 ) 

cgtcgccat: graaccagc: c:aaaagc:g tatttgagrt taccaccctt gtcactaaga 60) 

aaataaatgc agggtaaaat ttzatatcctt cttgttttat gtttcggtat aaaacactaa 660 

tacc.aanr tc tgtggttata ctaaaagtcg tttgtnggtt caaataatga ctaaatatct 720 

ctttcccct: ccaactgtct aaatcaatnt tattaaag:t car:tgatat gcctcctaaa 7 S j 

ttctcatcta aagtgaatct aggaggctta cttgtctgct ttcttcatta gaatcaatcc 340 

t: t z z z caaaa gtcaatat^a c:gtaaca:a aatatatatt tcaaaaatat cccactt:at 900 

ccaa:r:"cg tttgttgaac taatgggtgc tttagctgaa gaataaagac cacattaaaa 9 60 

aacg:ggtct tttgtgtttt :r:aaaggat ttgagcgtag cgaaaaatcc ttctcttcct 1020 

tatcctgata ataagggtaa ccattgaact cggtaccaag agtttgtaga aacgcaaaaa 10S0 

ggccatccgt caggatggcc tcctgcttaa tttgacgcct ggcagtctat ggcgggcgtc 114 0 

ctgcccgcca ccctccgggc cgttgcttcg caacgctcaa atccgctccc ggcggatttg 12 JO 

tcctactcag gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg 1.160 

ac:gagcctt tcgttttatr tgatgcctgg cagt:ccc:a ctcrcgcatg gggagacccc 1.^10 

acaczaccaz cggcgctacg gcgtttcact tctgagttcg gcacggggtc aggtgggacc 13 3 0 

accgcgctac tgccgccagg caaattctgt tttatcagac cgcttctgcg ttctgattta 1440 

atctg:atca ggctgaaaa: cttctctcat ccgccaaaac aggatcctac ggaaatggag 1500 

cggcaaaacc gttttactct caaaa tc tea aaagaaaacc cccgataaag ggggcttttc I 5 6 0 

ttctacaaaa ttg:acgggc tggttcgttc cccagca:tt gt:caart:: gttttgatca 1610 

ttragaacag ccactt:cgg ctcat jgett gccg-::tctt qa.zcaga.caz cattttgtag 1 6 0 

gaaa:aa:aa tgaccttatc tcctt retge acaaggegtg cggctgcacc gtttaagcat 174 j 

atgacgccgc ttccccgttt accaggaata atacacgttt caagacgtgc tccattatta 1 ? J D 
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ttattcacaa tttgtacttt ttcattagga agcattccca cagcatcaat gagatcttca 1860 

tcaa::g:aa tgcttcccac atagttcagg tttgcctccg caacag:cgc cctgtgaagt 1921 

ttgccgczca tcattgttcg atacatatta tatcccctcc att:c:cgaa :atcaataat 19 -\ I 

gatat:a:ct atcaaacgcg cttttgaaaa agcaactgca acagcgagaa :ca:c:ttcc 204 1 

agcaatctca ttcacaggct cgagttccgg a:aggaa:aa agctcracat agtccatggt 210 1 

tccgctag:c gtrtcaatga tatcttttgc agcc:tta:c accgc::cag gatctctttc 216-1 

accggcttgg acaagctccg cac:tgt:tg aagggcccga tacagcttag gcgcttcttt 2213 

tcttccc:ca gcrgttaagt atacattgcg agagcttttg gctaagccgt c:tcc:ctct 22-K 

gaccg:a:cg acaggaacca attcaatatc catgaagaag rcgcngatta acccatcaac 234! 

aacagc:acc tgctgcgcat cttttaaacc gaaa:aggca cgagtcggct tgacragatt 2 4'*'-! 

gaaaagc.cc gtcagtacga ccgcgacccc gtcaaaacgc cctcctcttg agcgcccgca 2 4c",:; 

taacacgrct gtgcgncttt ctacatgaa: cgtgacattc rtttcaccgg gatacatatc 252C 

argagcatct ggcgtaaaaa gaatatcgac cccggcgtt: tctgcaagag ctgcatcccg 25-3 

ctcaatatcg cgcggatatg cttcaaaatc ttcactaggg cogaattgtg caggattcac 2640 

aaaaa:accc ataataacgg cgtcgttttc tcg:cttgc: ttgtctgcta aggttaaatg 270 

cccctcacgc agaaacccca ccgtcggaac aaatccgat: gacccgccct ctgaatggta 276 

ttgtttcacg gcttctttca gctgtgaaat atcagtaatc tgtctcatct 

cgtacaagcc gtcaagcact gtccggttca tttgaaagga a:gcctttgt tcagggaaag 2 3' ; ; 

cacgatgtc: tacatcctga acatatccgc tgattgctg: tncgatggtt tcatcaatgc 294 

gcgtatattg ct'tacaaat ttaggtgttc tctcaacacc gtggccgata atatcatgac 300 

aaacga^aac ttgtccgtcc gcttneacae cagccccgaf. tccaatgacc ggtatgctta 3 06 

gcgtctcggc aattttggct gtgagttctg ccggcacaca ttccagcaca agcatcatag 3 L2 

ctcctgc::c ttcgcatttt atactgtctt ctattaatt: tttggcgctt tgttcgtct^ 2 1-> 

cgccctgtac ttcatagccg cccagtacgc cgaccgactg cggtgtcaaa cctaagtgac 

tgactactgg aatgcctcca agcgtcaatg cgcgaatgga ttcaaacacg ccttctccgc 3 3 0 

cctcaagctt cagtgcgtca gctccgcttt cctgaacga" agccgctgca tttttcagcg 3 3 

ta:cttcctt agacaggtga taagacataa acggcatatc tgtcacaata aaggtattcg 341 j 

gcgcarccct tttaacggct tttg:aLgal ggatcatgt: cgcaactgt^ acaccgacaq 3 4 j 

:tgaa:caa3 gccgaggacg accattccaa gtgaatcacc ga::aaaatc atgt-raactc J 5 4 j 

::jct:g:tc agraagtrta gctgccggat aat rataagc ggtcagcatg acaatcggt: 

:ttca^ac:c c:tca:t titt aqaaaatcca atttt.att:r. catgttttct; :ctc:ttcta 



324 
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gagcgtcctg ct gttgttaa gattattata ccacaccttg tagataaagt caacaacttt 3720 

t tgcaaaann tttcaggaat tttagcagag gttgttctgg atgtagaaca aaacatc::t 3 7-1} 

ccgctc::gc gctgttagga tatctttctt ggaagctagg taggcctcga gttatggcag 3340 

ttggttaaaa ggaaacaaaa agaccgtttt cacacaaaac ggtctttttc gatttctttt 3 9 j [■ 

tacagtcaca gccacttttg caaaaaccgg acagcntcat gccttataac tgctgtttcg 3900 

gt :gacga:g aictgccgtt ttcttctgca agccaaaaaa cct:ccgtta caacgagaag 4 02 0 

gartcttcac t:nctaaagt tcggcgagtt tca:ccctct grcccagtcc ntttttggat 4 0--0 

caaggcagac tgctgcaatg tctatctatt ttaa:aatag gtgcagtteg caggcgatac 4140 

tgcccaatigg aagtatacca aaatcaacgg gctcg:acca acacattagc ccaattcgat 42. 1 J 

atcggcagaa tagatttttt taatgccttc gttog:ttct aaaagcagaa cgccttcatc 42 

atctatacct aacgccttac cgtaaaaggt tccgnntaac gntctggctc ccacattagt 43 

gccaataccg agcgcaragc tttcccataa aagcc:aatc ggcgtaaatc cgcgcgtcat 43 

ataaccccgg taccgtttct caaagcatag taaaatatgc tggatgacgc cggcccgatc 444 

aat::tt:cc ccagcagctt ggctgaggct tgtcgegatg tccntcaat t ca:c:ggaaa 4 5 '.■ 0 

atca:taggc tgctggctaa cgttaatgcc gatcooaatg atcactgaac gtacgcggtc 4 5 6 0 

ttcttcagcc tgcatttccg ttaggatacc gactgctttt tttccgttaa rcaaaatatc 4 62 0 

atctggccat: ttaatacccg tttggatgcc tgctgcctct tcta:tccct gcacaacagc 46-;0 

tactgcagca agcagagtca gctgcggtgt: tttttggagc ggaatgtcag gccgcaaaat 4740 

caggctcatc caaacaccgt ttccttctcg agaatgccat accc:agaca ttcggcccct 4 3 0 0 

tccggcrgrt tgtt tgncag cca.cc3.c3.Ag ggtgccttcc ggtgcgttat tattcgcgag 4 0 60 

ctcacgagcc gtcttttgcg tgcttgaaag aacgccatgg taaacaagac gccggcccat 4 92 0' 

cacttccgtt ttcaatccaa aacgaatttc gc t: z ncac tg agtttitccgg gttt" ttgat 4 9 

gagccgatat ccttttcttc taacggcttc tacttcataa ccctcttccc gaagctcttc 5040' 

aa:atgct:c cacacagcag ttcttgaaca gccgagagca tcactgartt tttggccgga 510!:- 

aa:aaa:tca ttgccggcct gagaaaataa ttcaataagg tct::tctta atgt^gaccg 510 0' 

ca:g:c:tca gccactcctc tatgtgtttc ttttgattgg agagcttccc zgzcacaaca 52^,0 

qcctgctcga tccactgtaa ttcttctgac acccantttc cggccggccg gt" tcgaagc 52 

gcaagcaagt ccttacccg: gatatcaaga :c>:::aaggc ttttgatcgg caggttt tga 5 3 4 J 

:aagcg:act gaatgtcct: cagtttcttt :ca:ccagt: tttcgttttg ccgaagcugc 54 CO 

gatattttgg ccgctgagag cagtgc:::: ::cccagctc tg:aca:tgt cazzgcgzca 5463 
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aggctctggc caaacgtatc ggcaangtga atggcttccr tgatcacttt tcccgggagc 55-0 

ttccaggctt tcaggaaaag gggcgcgtct t^caaaacta tgccaaggtt aattaaaaga 5 5 --i 0 

gcagcccaaa gctcctcacg ggatgttaaa gagaagaatg gaaa:tcact cgttgaaatc 5641 

aggttccctc gtttatgata aaaaccagga agctcttcat acaatc:cgt ttgaatgagt 57)1 

gtttgaagcg cctggcgaga agctcttccc tgcagcaatt tctcaaactc tatagttttt 5 7-:" 

cgt tcgactg aaacatggga gaggagtgat ttttctttcg caatggcttc ttctgtttcc 5 310 

ggtgaaagcg taaagccaag ctggctcata aagcgtacgg ctctcagcat acgaagcgca 5-3 

tcctc::gaa atctatcctc aggctttcca acggttcgaa tcactttctg atcaatatct 59-10 

ttcttgccgc caaaataatc aagcaccttc ccgtccgctg tcatggccat cgcattgatc 6 00- 

gttaaatctc tgcgttttag atcctcttct aatgatgaga taaatrgcac ttctgacggt 6 0*-' 

cttctgaaat caacataatc agattcagtc cggaatgtcg tgactccata ggtttcatcc 6I_' 

tcccagagca caataatggt cccgtgctct ttgcctacat caacagtccg ctgaaacagc 6 1-" 

cgttctactt gatcaggtgc cgcatctgtc gcgatatcga catctccgat cgttcgtttc 614. 

atatagctgt cacgaactgc gcccccgaca aaataagcct gatggcccgc ttcgattaag 63 v; 

atgcggagca cgggaagtgc tttgataaaa actttttcca tgtgatcact ccggttctgc 6. ':•'.> •■ 

taaatcggca taaatctgtt cacactggct gacaattttt ttagaagaaa attcattttc 64.: 

aagcaicrct attgccgcct ttgtaaaacg attgcttagc tgttcatctt ctaaaatgct 6 4-.. 

catcgcgcgg gctgttgcgg ccgtaacatc accgacatcc accaaaaatc cgctcacatt 654 1 

gttrtttata acctcaggga taccgccaat gtttgttcca atacaaggca ctccgcaagc 66 00 

catcgcttca agcaggacaa ggccaaagct ttcttttnca gatagcagca gcttcaaatc 6660 

gctaatagaa taaagatctt caacacggtc ttgatttcca agcattaaga cttggtcttc 6710 

caagccatat tttctgataa gctcgcaggc tgtcgatttc tccggaccgt ctccgactaa 67 ~;0 

aagcagcttc gctttcgttt tgccagcgat attgcggaac acacggatga carcctgcac 6840 

gcgtgcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtr: 69 00 

cttgaagtgg tggcctaact acggctacac tagaaggaca gnarttggia cctgcgctct 696 0 

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 7010 

cgctggtagc ggtggrtttt: :tg:ttgcaa gcagcagatt acgcgcagaa aaaaaggarc 70 : .C 

tcaagaagat: cctttgatct "ttc:acggg gtc tgacgc L. cag:ggaacg aaaactcacg 7 140 

ttaagggatt: t tgg^catga gattatcaaa aaggatcttc acc:aga:cc ttt:aaa::a 71''0 

aaaatgaag: zzZcLcL&zcaci tctaaagta: a:a:gag:aa actcggtctg acagt:acca 7160 

atgcttaatc a g t g a gg c a c cta:ctcagc gatictgtcta tLtLtcgttcat: ccatagt: g c 7 311 
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ctga:t:ccc gtcgtgtaga taactacgat acgggagggc ttaccarctg gccecagtgc 73 8 

tgcaacgata ccgcgagacc cacgctcacc ggctccaga: tta:cagcaa caaaccagcc 7 4 4 

agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccag:cta: lb': 

taat:g:tgc cgggaagcta gagtaagtag ttcgccagtt aacagtttgc gcaacgctgc 7 00 

tgccat:gct acaggcatcg tggtgtcacg ctcgtcgttt ggra:ggctt cattcagc:c 70 2 

cggt:cccaa cgatcaaggc gagtcacatg atcccccatg trgtgcaaaa aagcggc:ag 16 1 

ctcc-tcggt ec tec gat eg ttgteagaag taagttggce gcagtgttat eaeteatggt 774 

tatggcagca etgeataatt ctcttaetgt cacgccatcc gtaagatget tttctgtgac 7 8 0 

tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctettg 7 8 0 

cccggcgtca ataegggata ataccgcgcc aeatagcaga actttaaaag tgc tea teat 7 92 

tggaaaaegt tettegggge gaaaactetc aaggatctta cegetgttga gatceagtte 7 9 8 

gatgtaaeee aetegtgeac ceaaetgatc ttcageatct tttaetttea ecagegttte 8 04 

tgggtgagea aaaacaggaa ggeaaaatgc cgeaaaaaag ggaataaggg egaeaeggaa 31 j 

atgttgaata ctcatactet tcctttttea atattattga ageatttate agggttattg 310 

teteatgage ggatacatat ttgaatgtat ttagaaaaat aaaeaaatag gggtteegeg 32 2 

caeattteee egaaaagtgc caectgaegt ctaagaaace attattatea tgaeattaac 32 8 

ctataaaaat aggegnatea cgaggceett tegtctcgea tgeggateag tgagggtttg 3 3 4 

caaetgeggg teaaggatet ggatttegat caeggeacga teategtgeg ggagggeaag 3 4 : j 

ggctccaagg ategggcett gatgttaecc gagagettgg caeeeageet gcgegageag 34 0 

gggaattgat ceggtggatg accttttgaa tgaectttaa tagattatat taetaattaa 30.:.' 

ttggggaeec tagaggtcec cttttttatt ttaaaaattt ttteacaaaa eggtttaeaa 30 8 

geataaeggg ttttgetgee cgeaaaeggg etgttetggt gttgetagtt tgttateaga 304 

ategcagatc eggcttcagg tttgeegget gaaagegeta tttettceag aattgceatg 3'7 j 

attttttcee caegggagge gtcactgget eecgtgttgt eggeagcttt gattcgataa 37 0 

geageatcge ctgttteagg ctgtetatgt gtgactgttg agetgtaaca agttgtetea 381 

ggtgttcaa: ttcatgttet agttgctttg ttttaetggt tteaeetgtt etattaggtg 888 

ttaeatgctg tteatctgtt aeattgtcga tetgttcatg gtgaacaget t t a a a tgeae 894 

caaaaactcg taaaagetet gatgtatcta tcttttttac aregttttea tctgtgeata 90 j 

tggacagttt tecctttgat atetaaeggt gaacagttgt tetaettttg tttgttagtc 900 

ttgatgette aetgatagat acaagagcea taagaacete agatecttce gtatttagee 912 



BGI-141CP 



- 90 - 



agtatgttc t 

t c a t CJ C t tl a c 

go agttaaag 
atggttgttg 
agatccattt 
ttatcaacca 
accca 1 1 ggt 
aa'tcatcaa 
aa:aaccact 
tcatatttta 
aa:tc:aatt 
:caaccaaag 
aatacaccat 
aacgacaccg 
cagcacaaaa 
"ttgctttga 
ctataocaat 
ggtatc tgt a 
gtaaattccg 
agaataaaga 
ctac t ttagt 
caaaacagac 
ctgaatatcc 
tcagtccgct 
:t:a:gga:t 
agggcgt t t t 
cc tgccctc t 
:ggc:aatgc 



c tagtgt gg t 
cttgcatgtc 
cat cgtgtag 
gtattttgtc 
gtc tatc tag 
ccaatttcat 
taagcc 1 1 1 1 
ggctaatctc 
cataaatcct 
tgaa 1 1 tttt 
tttcgcttga 
gattcctgat 
aagcat t t tc 
tccgt tc 1 1 t 
t tagc t tgg t 
aaacaac taa 
tgagatgggc 
aa 1 1 c tgc ta 
ctagaccttt 
aagaa taaaa 
cagttccgca 
c ttaaaaccc 
cttttgtctc 
gcgctcacgg 
catgcaagga 
atggcgggtc 
gat 1 1 1 ccag 
acccagtaag 



tcgttgttt t 




accattcatt 
t tcaac t tgg 
a t tgctgtaa 
aaac tcatgg 
tatatttgcc 
c a tagagta t 
taactggaaa 
gaacttggca 
ttccacagtt 
cctactgatg 
ccttgtaggg 
t tcatgc tec 
t tcagaca ta 
t agtcaat ga 
gacc 1 1 tgc t 
gtgtgttttt 
aaaga taaaa 
gtat tacaaa 
taaaggc tta 
cgaccatcag 
ctctggcagt 
aac taccca t 
tgc tatgtgg 
tc tgaccac t 
geageggtat 



tgcgtgagcc 
1 1 tgc :tcaa 
agtccgttac 
1 1 tatc tgg t 
aaaatcaacg 
gtgtttaaat 
tagttatttt 
ttgtgagttt 
t tgt 1 1 1 caa 
agataaggca 
tagt t tgtc c 
ctcgtcatc a 
ttcatcatct 
ttttcaatcg 
gt taagtcat 
catctcaatt 
taa t tac tag 
ggaaaacttg 
tttgtttata 
agaatagatc 
aggatgtege 
agtagcaccc 
gcacctgag: 
gaatgggggt 
aa <_acaagaa 
tgc tatc tga 
teggattate 
caLcdacagg 



at gagaacga 
aac tggtgag 
gtaggtagga 
tgttctcaag 
tatcagtegg 
ctttacttat 
caagcattaa 
tcttttgtgt 
aagac t taac 
atatctcttc 
actggaaaat 
gctctctggt 
gagegtattg 
tggggttgag 
agegae taat 
ggtc taggtg 
tccttttcct 
taaattctgc 
ttcaagtggt 
ccagccc tgt 
aaaege tgt t 
tcgcaagctc 
cgctgtcttt 
aaatggcact 
aagece gtca 
etttttgetg 
ccgtgacagg 
cttacccgtc 



accat tgaga 
c tgaa 1 1 1 1 1 
a t c t gatgt a 
ttcggttacg 
geggee t cgc 
tggt t tcaaa 
ca tgaac t ta 
tagttct 1 1 1 
a tg 1 1 ccaga 
accaaaaac t 
ctcaaagcct 
tgc 1 1 tagc t 
gt ta taag tg 
tagtgecaca 
cgc tagt tea 
attttaatca 
ttgagttgtg 
tagaccc tc t 
tataat ttat 
gtataac tea 
tgc tec tc ta 
gggcaaatcg 
t teg tgaca t 
acaggcgcct 
eggge etc tc 
1 1 cagcagt t 
t ca 1 1 cagac 
1 1 a c t a t c a a 



918 3 

9 3 3 

9 4 2 

9 4 ;■ 
9 5 4 

9 6 0 3 

9 6 3 3 

9 7 2 j 

9 7 ■-. : 

9 3 4 2 

Q m .") ■": 

9 9 3 0 

1 J 3 2 0 

i >; .-. o 
IV;.; o 

1 3 2 0 3 

1 0 2 C) 

1032 J 

1 0 3 3 0 

1 0 4 4 3 

1 0 3 0 j 

1 3 5 - 3 

1 0 6 2 0 

1 3 3 3 r 3 

13 743 

13 83 3 
^ -i 

j ^ ^ ^ 
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<212> DMA 

<210> Artificial Sequence 

< 2 2 0 > 

<22:> Description of Artificial Sequence: Recombinant 
pAN2 3 6 pi asm id 

< 4 D j > 7 7 

ctcgaggcct acctagcttc caagaaagat atcctaacag cacaagagcg gaaagatgtt 60 

ttgttctaca tccagaacaa cctctgctaa aattcctgaa aaattttgca aaaagttgtt 12 

gactttatct acaaggtgtg gtataataat cttaacaaca gcaggacgct ctagaggagg l--i 

agacatcatg aaaattggaa ttatcggcgg aggctccgtt ggtcttttat gcgcctatta 24 

tttgtcactt tatcacgacg tgactgttgt gacgaggcgg caagaacagg ctgcggccat 30': 

tcagcctgaa ggaatccggc tttataaagg cggggaggaa ttcagggctg attgcagtgc 36: 

ggacacgagt atcaattcgg actttgacct gcttgtcgtg acagtgaagc agcatcagct 42" 

tcaatctgtt ttttcgtcgc ctgaacgaat cgggaagacg aatatattat ttttgcaaaa 4 3 0 

cggcatgggg catatccacg acctaaaaga ctggcacgtt ggccattcca tttatgttgg 54;: 

aatcgttgag cacggagctg taagaaaatc ggatacagct gttgatcata caggcctagg 6 00 

tgcgataaaa tggagcgcgt tcgacgatgc tgaaccagac cggctgaaca tcttgtttca 660 

gca:aaccat tcggattttc cgatttatta tgagacggat tggtaccgtc tgctgacggg 72 C 

caagctgatt gtaaatgcgt gtattaatcc tttaactgcg ttattgcaag tgaaaaatgg 7-;0 

agaactgctg acaacgccag cttatctggc ttttatgaag ctggtatttc aggaggcatg 840 

ccgcatttta aaacttgaaa atgaagaaaa ggcttgggag cgggttcagg ccgtttgtgg 900 

gcaaacgaaa gagaatcgtt catcaatgct ggttgacgtc attggaggcc ggcagacgga 9 0 0 

agctgacgcc attatcggat acttattgaa ggaagcaagt cttcaaggtc ttgatgccgt 102 

ccacctagag tttttatatg gcagcatcaa agcattggag cgaaatacaa acaaagtctt 10? 

ttgagctttt tcggtaacat gctatactca tttcggatca ctaactattt attggaggat 114 

cctgttttgg cggatgagag aagattttca gcctgataca gatcaaatca gaacgcagaa 12 'j 

gcggtctgat aaaacagaat ttgcctggcg gcagtagcgc ggtggtccca cctgacccca 12 0 

tgccgaactc agaagtgaaa cgccgtagcg ccgatggtag tgtggggtct ccccatgcga 10 2 

gagtagggaa ctgccaggca tcaaataaaa cgaaaggctc agtcgaaaga ctgggccttt 1 .;' 

cgttttatct gttgtttgtc ggtgaacyct ctcctgagta ggacaaat :c geegggageg 14 4 

gatttgaacg ttgegaagea acggcccgga gggtggcggg caggacgccc gecataaact 100 



gccaggcatc aaattaagca gaaggecate ctgacggatg gectttttge gtttctacaa 
actcttggta cccagaaaaa gcggcaaaag cggctgttaa aaaagcgaaa tcgaagaagc 



1 



BGM41CP 



- 92 - 



tgtctgccgc taagacggaa tatcaaaagc gttetgctgt tgtgtcatct ttaaaagtca 168 D 

cagecgatga atcccagcaa gatgtcctaa aaracttgaa cac:cagaaa gataaaggaa 1 7 4 J 

atgcagacca aattcattct tattatgtgg tgaacgggat cgctgtccat gcctcaaaag 13 J J 

aggrtatgga aaaagtggtg cagtttcccg aagrggaaaa ggtgcttcc: aatgagaaac 136 J 

ggcagrcttt taagtcatcc tccccattta ata:gaaaaa agcacagaaa gctattaaag 192 0 

caactgacgg tgtggaatgg aatg:agacc aaatcgatgc cccaaaagc: tgggcacttg 19-1 -j 

gatat gatgg aactggcacg gttgttgcgt ccattgatac cggggtggaa tggaatcatc 2 040 

cggca:taaa agagaaatat cgcggatata atccggaaaa tcctaatgag cctgaaaatg 2100 

aaatgaactg gtatgatgcc gtagcaggcg aggcaagccc ttatgatgat ttggctcatg 111 rO 

gaacccacg: gacaggcacg atggtgggct ctgaacctga tggaacaaat caaatcggtg 2 2 2 0 

:agcacctgg cgcaaaatgg attgctgtta aagogttctc tgaagatggc ggcactgatg 2 2 SO 

ctgacatttt ggaagctggt gaatgggttt tagcaccaaa ggacgcggaa ggaaatcccc 2 0 4 0 

acccggaaa: ggctcctgat gttgtcaata actcatgggg agggggctct ggacttgatg 2400 

aatggtacag agacatggtc aatgcctggc gttcggccga tattttccct gagttttcag 2 4 0 0 

cggggaacac ggatctcttt attcccggcg ggcctggt tc tatcgcaaat ccggcaaac" 2520 

atccagaatc gtttgcaact ggagcgactg agaattccaa ttccccatgg agagaaaaga 2 5-10 

aaatcgc:aa tgttgattac tttgaacttc tgcatattct tgaatttaaa aaggctgaaa 2640 

gagtaaaaga ttgtgctgaa atattagagt ataaacaaaa tcgtgaaaca ggcgaaagaa 2700 

agttgtatcg agtgtggttt tgtaaatcca ggctttgtcc aatgtgcaac tggaggagag 2760 

caatgaaaca tggcattcag tcacaaaagg ttgttgctga agttattaaa caaaagccaa 2 r< 2 0 

cagttcgttg gttgtttctc acattaacag ttaaaaatgt ttatgatggc gaagaattaa 2:330 

ataagagttt gtcagatatg gctcaaggat ttcgccgaat gatgcaatat aaaaaaatta 294? 

ataaaaaccc tgttggtttt atgcgtgcaa cggaagtgac aataaataat aaagataatt 0000 

cttataatca gcacatgcat gtattggtat gtgtgga a cc aacttatttt aagaatacag 0060 

aa&Acz&cqz gaatcaaaaa caatggattc aattttggaa aaaggcaatg aaattagact 312 0 

atga.icc.B.&a. tgtaaaagtt caaatgatt-r gaccgaaaaa taaatataaa tcggaratac 0130 

aa:cggcaa: tgacgaaact gcaaaatatc ctgtaaagga tacggatrtt atgaccgatg 3 2 40 

atgaagaaaa gaatttgaaa cgtttgtctg atttggagga aggtttacac cgtaaaaggt 2 2 0~. 

zaazczccza tggtggtttg ttaaaagaaa tacataaaaa attaaacctt gatgacacag 0360 

aagaaggcga tttgattcat acagatgatg acgaaaaagc cgar.gaagat ggattttcta 342 2 
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ttatcgcaat gtggaattgg gaacggaaaa attattttat taaagagtag ttcaacaaac 34S0 

gggccatatt gttgtaraag tgatgaaata ctigaatntaa aacttagttt atatgtggta 354 j 

aaatgtttta atcaagttta ggaggaatta attatgaagt gtaatgaatg taaeagggtt 3 6 j j 

caat:aaaag agggaagcgt atcattaacc ctataaacta cgtctgccct catcaccgga 3 -5 •:■ I 1 

gggtgaaatg tgaatacatc ctattcacaa tcgaatttac gacacaacca aat:t:aatt 3 7 2 j 

tggccttgca ttttatcttt tcttagcgta t:aaatgaaa tggctttgaa cgtctcatta 37- j 

cctgatattg caaatgattt taataaacca cctgcgagta caaaccgggt gaacacagcc 3 3 4 J 

tttacgttaa ccttttccat tggaacagct gcatatggaa agc:atctga tcaattaggc 390 j 

atcaaaaggc cactcctatt tggaattata acaaattgtt tcgggceggt aatcgggttt 3960 

gttggccatc ctttctcttc cttacttatt acggctcgtt ttattcaagg ggcrggtgca 402 j 

gctgcatttc cagcactcgt aatggttgta gttgcgcgcc atactccaaa ggaaaatagg 4 0--0 

ggtaaagcac t:ggtc:tat tggatcgata gtagccatgg gagaaggagt cggrccagcg 4L40 

attggtggaa tgatagecca ttatattcat tggtcctatc ttctactcat tcc:acga:a 4000 

acaattatca etgtteegtt tcttatgaaa ttattaaaga aagaagtaag gataaaaggt 40 0 0 

cattttgata ccaaaggaat tatactaatg tctgtaggca ttgcaccttt tatgttgtrt 4310 

acaacatca: a:agcacttc tcttcttatc gttagcgcge tgtcacrcct gatatccgca 43 1 0 

aaacatatca ggaaagtaac agatcctt:t gctgatcccg gattagggaa aaacacacet 44 4 

tctacgattg gagctecttg tgggggaact aratttggaa cagcagcagg gttigtctct 450 

atggttcctt acatgacgaa agatgttcac cagctaagta ctgccgaaat cggaagcgta 4560 

attactttcc ccggaaeaat gagtgtcatt attttcggct acactggtgg gatacc:gtt 4000 

gatagaagag gtcettcata cgtgttaaac accggag::a catctctttc tgttagcttt 4 6-0 

ttaactgctt cctttctttt agaaacaaca tcatggttca tgacaattat aatcgcattt 474 0 

gtttcaggtg ggc:ttcgtt caccaaaaca gttatatcaa caattgtttc aag cage teg 4-:/0 

aaacagcagg aagetggtgc tggaatgagt ttgcttaaec ttaecagect tttateagag 4-; 6 0 

ggaaeaggta ccgeaactgt aggtggttca ttatccatac ectcaettga tcaaaggtcg 4900 

tcacetatgg aagctgatca gtcaacttat ctgtatagca atttgttacc actctcttea 4 9--0 

ggaaceattg ceactagttg getggttaec ctgaatgeae ataaacattc ccaaagggat 5 "'40 

ccetaaaceg ccaagggatc aaetttggga gagagttcaa aat cga tee c ^"^"araa 51"' 0 

cagtcegaag cggccgcaat tcctgaagac gaaagggeec egtgacacgc ecactcctat 516 0 

aggccaacgc eacgacaata atggccccct agaegteagg tggeaettcc eggggaaacg 522 0 

t g e g e g g a a e eeecacccgt CCatccctec aaazacaz*: c a a a c a c g c a c e c g e c e a c g a 5 2^2 
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gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 5 j. 4 ? 

atctccgtgt cgcccttat: cecttttttg cggcattttg crttectgtt tttgctcacc 5 4 ':■ ]> 

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggt gcacga gtgggttaea 546 

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc c - ^ " 



caatgatgag cacttttaaa gttctgctat gtggcgcggt attaceecgt attgacgccg 



5 5 



ggcaagagca actcggtcgc cgcatacact attctcagaa tgacctggtt gagtactcac 56 4-! 

cagtcacaga aaagcatct: acggatggca tgacagtaag agaantiatgc agtgctgcca 5 7 ' j 1: 

taaccatgag tgataacac: gcggccaact tacctctgac aacgarcgga ggaccgaagg 576-! 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgcc^ngat cgttgggaae 5 

egg age :gaa tgaagecata ccaaacgacg agegtgacac cacgatgcct geagcaaegg ^ 

caacaaegtt gegcaaacta ctaactggcg aactiacttac tctagcttcc eggcaacaat 5040 

taatagactg gatggaggcg gataaagttg caggaccact totgegcteg gcccttccgg 60 ; "O 

ctggctggtt tattgetgat aaatctggag ccggtgagcg tgggtctege ggtatcattg 6 0 6 J 

cagcaccggg gccagatggt aagccctccc gtatcgtagt tatctacacg aeggggagtc 6 12 0 

aggcaactat ggatgaaega aatagacaga tegctgagat aggtgectea ctgartaagc 61 : '0 

attgg:aact gtcagaccaa gtttactcat atatactcta gattgattta aaac:tca:: 6 2 4 j 

tttaatttaa aaggatctag grgaagatcc tttntgataa tctcatgacc aaaacccct: I'i-OO 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttct: 6 3 60 
gagatcc ttt 

eggtggtttg tttgeeggat caagagctac caactctttt tocgaaggta ac:ggc::ca 6 4.-;0 

geagagegea gataccaaa: actgtccttc tagtgeagee gtagttaggc caccacttca 6540 

agaaccctgt agcaccgcct acatacctcg etctgetaat cotgttacca gtggetgetg 6 6 0 0 

ccagcggcga taagtcgtgt c:taccgggt tggactcaag acgatagcta ceggataagg 6660 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagectggag cgaacgacct 6720 

acaeegaaet gagataccta cagegtgage tatgagaaag cgeeaegett cccgaaggga 67 ; ; . 0 

gaaaggcgga caggtacccg gtaageggea gggteggaac aggagagege acgagggagc 6 £4 0 

ttccaggggg aaacgcctgg :atctttata g:cc"gtcgg g:::cgccac ctctgacttg 6000 

agegtegatt tttgtgatgc tegtcagggg ggeggagect atggaaaaac gccagcaacg 6 0 ^ 0 

cggcc::t:t acggttectg gec" tttget ggccttttgc tcacatgttc tttcctgcg: 7 v. 0 

tatecectga ttctgtgga: aacegtatta ccgcctttga gt gage t gat acegetegee 
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gcag:cgaac ga:cgagcgc agcgagccag tgagcgagga agcggaagag cgcctgatgc 71- 

ggratitititic t: -:c::acgcat ctgtgcggta r::cacaccg catatggtgc actctcagta 72 j j 

caa:c:gctc :ga:gccgca tagtcaagcc agtatacact ccgc:atcgc tacgtgactg 726 J 

ggtcarggcn gcgocccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 7 .3 2 J 

gctcccggca :ccgct:aca gacaagctgt gaccg:c:cc gggagctgca tgtgccagag 7 3 

gtt:tcaccg :caccaccga aacgcgcgag gcagcngcgg taaagctca: cagcgtggtic 744 j 

gtgaagcgat: "cacaga:gt c:gcc:gttc anocgcgticc agctcgttga gtctctccag 7600 

aagcgctaa: gtccggcttc tgataaagcg ggccatgtta agggcggt tr tttcctgttt 7560 

ggtcactcga tgcctccgtg taagggggaa ::tctgt:ca tgggggtaat gacaccgatg 762 0 

aaacgagaga ggatgctcac gatacgggtt actgatgatg aacatgcccg gttactggaa 76'60 

cgtngtgagg gtaaacaact ggeggtatgg acgcggcggg accagagaaa aaccactcag 7740 

ggtcaatgcc agcgcttcgt taatacagat gtagg:gttc cacagggtag ccagcagcat 78 jO 

cctgcgatgc agatccggaa ca:aa:ggtg cagggcgctg acttccgcgt ttccagacnt 7 360 

tacgaaacac ggaaaccgaa gacca:tcat gttgttgctc aggtcgcaga cgttttgcag 79.-. 0 

cagcagtcgc ctcacgttcg crcgcgtatc ggngattcac tctgctaacc agtaaggcaa 79-^0 

ccccgccagc ctagccgggt cctcaacgac aggagcaoga tcatgcgcac ccgtggccag 8 0 4 0 

gacccaacgc cgcccgagat gcgccgcgcg cggctgc:gg agatggogga cgcgatggat 3 Oi J 

atgntctgcc aagggttggt ttgcgca^tc acagt^cccc gcaagaattg attggctcca 3160 

attcntggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag gtcgaggtgg 32 2 0 

cccggctcca cgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta 32 30 

caatccatgc caacccgttc ca:gtgc:cg ccgaggcggc a:aaatcgcc gtgacgatca 304 j 

gcggtccagt gatcgaagtt aggctggcaa gagccgcgag cgatccttga agctgtccct 8400 

gatggtcgcc atctacctgc ccggacagca tggcctgcaa cgcgggcatc ccgatgccgc 346 0 

cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcggc ggccgcttcg 8 52 ) 

zcgaccgaaa cagcagttat aaggcatgaa gctgtccggt ttttgcaaaa gtggctgtga 853 J 

ctgtaaaaag aaatcgaaaa agaccgtttt gtgtgaaaac ggtctttttg tttcctttta 8640 

accaaczqcc ataa 8654 
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<223> Description of Artificial Sequence: Recombinant 
pAN42 3 plasmid 

<400> 78 

ggcggccgct tcgtcgaccg aaacagcagt tataaggcat gaagctgtcc ggtttttgca 6 



1 1 



i 



aaagtggctg tgactgtaaa aagaaatcga aaaagaccgt tttgtgtgaa aacggtcttt 120 

ttaaccaact gccataactc gaggcctacc tagcttccaa gaaagatatc 1 

ctaacagcac aagagcggaa agatgttttg ttctacatcc agaacaacct ctgctaaaat 2- 

tcctgaaaaa ttttgcaaaa agttgttgac tttatctaca aggtgtggta taataatctt 3 

aacaacagca ggacgctcta gaaaaggagg aatttaaatg tatcgtacga tgatgagcgg 3 6 0 

caaacttcac agggcaactg ttacggaagc aaacctgaac tatgtgggaa gcattacaat 420 

tgatgaagat ctcattgatg ctgtgggaat gcttcctaat gaaaaagtac aaattgtgaa 430 

taataataat ggagcacgtc ttgaaacgta tattattcct ggtaaacggg gaagcggcgt 540 

catatgctta aacggtgcag ccgcacgcct tgtgcaggaa ggagataagg tcattattat 6 0'.) 

ttcctacaaa atgatgtctg atcaagaagc ggcaagccat gagccgaaag tggctgttct 660 

gaatgatcaa aacaaaattg aacaaatgct ggggaacgaa ccagcccgta caattttgta 720 

aaggatcctg ttttggcgga tgagagaaga ttttcagcct gatacagatt aaatcagaac 730 

gcagaagcgg tctgataaaa cagaatttgc ctggcggcag tagcgcggtg gtcccacctg 340 

accccatgcc gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc 900 

atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc gaaagactgg 960 

gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg 1020 

ggagcggatt tgaacgttgc gaagcaacgg cccggagggt ggcgggcagg acgcccgcca 1030 

taaactgcca ggcatcaaat taagcagaag gccatcctga cggatggcct ttttgcgttt 1140 

ctacaaactc ttggtaccca gaaaaagcgg caaaagcggc tgttaaaaaa gcgaaatcga 1200 

agaagctgtc tgccgctaag acggaatatc aaaagcgttc tgctgttgtg tcatctttaa 1260 

aagtcacagc cgatgaatcc cagcaagatg tcctaaaata cttgaacacc cagaaagata 1320 

aaggaaatgc agaccaaatt cattcttatt atgtggtgaa cgggattgct gttcatgcct 13 30 

caaaagaggt tatggaaaaa gtggtgcagt ttcccgaagt ggaaaaggtg cttcctaatg 1440 

agaaacggca gctttttaag tcatcctccc catttaatat gaaaaaagca cagaaagcta 15)0 

LldddyCddC ICJdCy'-jLgLg gaaL.ggaaug Laja^L-aaa „ k_. y a l- y _ '~. ci da(iyv.L.^yy -> ^ 

cacttggata tgatggaact ggcacggttg ttgcgtccat tgataccggg gtggaatgga 1620 

atcatccggc attaaaagag aaatatcgcg gatataatcc ggaaaatcct aatgagcctg 16?0 

aaaatgaaat gaactggtat gatgccgtag caggcgaggc aagcccttat gatgatttgg 1740 
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ctcatggaac ccacgtgaca ggcacgatgg tgggctrtga acctgatgga acaaatcaaa 13jO 

tcggtgtag: acctggcgca aaatgga:tg ctgttaaagc gttctctgaa gatggcggca I?-"; 

ctgatgctga cattttggaa gctggtgaat gggtttragc accaaaggac gcggaaggaa l'?2'J 

atccccaccc ggaaatggct cctgatgntg rcaataactc atggggaggg ggctctggac 1 2 ■*■ J 

ttgatgaatg gtacagagac atggtcaatg cc tggcgttc ggccgatatt ttccct gage 2 04 j 

tttcagcggg gaataeggat ctctttattc ccggcgggcc tggttccatc geaaatcegg J 1 J 

caaactatcc agaatcgttt gcaactggag cgactgagaa ttccaattcc ccaeggagag 1:12 j 

aaaagaaaat cgctaatgtt gattactctg aacttc:gca tattcttgaa tttaaaaagg 222 J 

ctgaaagagt aaaagattgt gctgaaatat tagag:ataa acaaaatcgt gaaacaggcg 22 32 

aaagaaagtt gtatcgagtg tggttttgta aatccaggct ttgtccaatg tgcaactgga 1:3 1 
ggagagcaat gaaacatggc at:cagtcac aaaaggctgt tgctgaagtt attaaacaaa j 
agecaacagt ccgttggttg rtrctcacat raacagctaa aaatgtttat gatggcgaag j 

aattaaataa gagtttgtca ganatggctc aaggacnecg ccgaatgatg caacataaaa .:*'!:. J 

aaattaataa aaatcttgtt ggetttaegc gegcaacgga agrgacaara aa:aataaag 2 2 - 2 

ataattctta caatcagcac ar.gcatgcat tggtatgcgt ggaaccaact taetttaaga 2610 

anacagaaaa ceacgtgaat caaaaacaat ggattcaaet ttggaaaaag gcaargaaa: 22 [ j 

tagactatga tecaaaegta aaagttcaaa tgattcgacc gaaaaataaa ta:aaa:cgg 27': 0 

acatacaatc ggcaattgac gaaactgcaa aacatcctgt aaaggatacg gattttatga 2S2 0 

ccgatgatga agaaaagaat ctgaaacgtt tgectgaett ggaggaaggt ntacaccgra 2b 30 

aaaggttaa: ctcctacggt ggettgtcaa aagaaaeaca taaaaaatta aaccttgatg 1:912 

acacagaaga aggegattrg atccatacag atgacgacga aaaagecgat gaagatggat SO 22 

tttctattat tgcaatgtgg aattgggaac ggaaaaa:ta ttttattaaa gagtagtcca sO'iO 

acaaaeggge catattgttg tacaagtgat gaaatacega at:"aaaac: :ag::ta:a: 3122 

gtggtaaaat gttttaatca agtttaggag gaattaatta tgaagtgtaa tgaatg:aac 3130 

agggttcaat taaaagaggg aagegeatea tcaaccccat aaactacgcc tgccctcatt .•■14" 

aetggaggg: gaaatgtgaa tacatcctat tcacaatcga a:::acgaca caaccaaat: 22 j 

ttaatttggc titgca:ett atcttttett agcg:a::aa atgaaatggt ettgaaegtc 236 

tcateacceg atattgeaaa rgattttaat aaaccaccrg cgagtacaaa ctgggtgaac .: 4 2 

acagcctnta tgttaacctt etccattgga acagc:gtae atggaaagct: ^zczg^zca.3. ': 1 ^ 

ttaggcatca aaaggtract c c t a 1 1: z g g a at:a:aat.aa attigititcgg geeggtaatt: j. 5 4 



1 1 



BCil- 14 1 CP - L )X 





■i r- I 



gggtttgttg gccattcttt cttttcctta cttattatgg ctcgttttat tcaaggggct 3 3 ~' 3 

ggtgcagctg catttccagc actcgtaatg gttgtagttg cgcgctatat tccaaaggaa 3:'v j 

aataggggta aagcattrgg tcrtattgga :cga:ag:ag ccatgggaga aggagtcggt : ^ ^ ': 

ccagcgattg gtggaatgat agcccattat attcartggc cccatcttct ac:cat:cct 37- j 

acgataacaa ttatcactgt tccgtttctt atgaaatta: :aaagaaaga agtiaaggata 3 -4 J 

aaaggtcatt ttgatatcaa aggaattata ctaatgtctg caggcattgt acctttcatg 

tcgtttacaa catcata:ag catttctttt cttatcgtta gcgtgctgtc atccctgata 

tttgtaaaac atatcaggaa agtaacagat ccttttgttg atcccggatt agggaaaaat 40-0 

acacctttta tgattggagt tctttgtggg ggaattata: :tggaacagt agcagggttt 4v -'0 

gtctctatgg ttccttatat gatgaaagat gttcaccagc taagtactgc cgaaatcgga 4140 

agtgtaatta ttttccctgg aacaatgagt gtcattatt: "cggctacat tggtgggata 4100 

cctgttgata gaagaggccc tttatacgtg ctaaacatcg gagtcacatt tcnttctgtt 4 2 -JO 

agctttctaa ctgcttcctt tct:t:agaa acaacatca: gg:tcatgac aa:ta:aatc 43.-0 

gratttgttt taggtgggct ttcgttcacc aaaacagtta tatcaacaat tgc ttcaagt 43-0 

agcttgaaac agcaggaagc tggcgctgga atgagtttgc ::aac::tac cagcttctta 444 0 

tcagagggaa caggtattgc aatcgtaggt ggtttattac ccataccctt acmgatcaa 43 ;0 

aggttgttac ctatggaagt tgatcagtea acttatctgc atagcaattt gt:at:actt 4 0 r. 0 

t:ttcaggaa tcattgtcat: tagttggctg gttaccttga a:g"ata:aa acactctcaa 43 0 0 

aggga"tct aaatcgctaa gggatcaact ttgggagaga gttcaaaatt gatccttttt 4 0 '-: < j 

ttataacagt tcgaagcggc cgcaattctt gaagacgaaa gggcctcgtg acacgcctat 

ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acrttrcggg 40: 

gaaatgtgcg cggaaccccc attcgrttat ttttctaaat: aca:tcaaat acgtacccgc 4830 

tcatgagaca ataaccctga taaatgcttc aataaratrg aaaaaggaag agcatgagta 4 90 0 

t:caacatt: ccgtgtcgcc cttattccct ttttcgcggc a:::tgcctt cctgtctttg 49 : 0 

ctcacccaga aacgctggcg aaagtaaaag atigctgaaga tcagttgggt gcacgagtgg 5 04 0 

gttacatcga actggatccc aacagcgg:a agatccttga gagttttcgc cccgaagaac 3 100 

g:tttccaa: gatgagcact tttaaagttc tigctatgtgg cgcggtatta tcccgtattg 5 100 

acgccgygca ayagcaactc ggticgccgca :acactat:c : ragaatgac ttggttgagt 3 

actcaccagz cacagaaaag ca:ct:acgg atggcargac agtaagagaa ttatgcagtg 53-3 

ctgccaraac catgagtgat aacac:gcgg ccaac::ac: "ctgacaacg a:cggaggac 33 4 3 

cgaaggagct aaccgctttt ttgcacaaca tgggggarca zg^aacicac c:: gatcgtt 54 j 3 
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gggaaccgga gc tgaa t gaa gccataccaa acgacgagcg tgacaccacg atgcctgcag 5 4 v: 

caacggcaac aacgttgcgc aaactattaa ctggcgaac t acttactcta gc t ccccggc 5 5.; j 

aacaa: taa t agactggatg gaggcggata aagt cgcagg accacctccg c gc ccggccc 5 5 J 

ttccggctgg ctggtttatt gc tgataaat ctggagccgg tgagcgtggg tctcgcggta 564 j 

tcartgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 5 7 "i j 

ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagaraggt gcctcactga 570 j 

ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatctaaaac 530 j 

ttcattttta atctaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 5 3-:0 

tcccttaacg tgagttt teg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat 594 j 

ctccttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 60 jj 

taccagcggt ggcttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 6 00 0 

gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 6 10 J 

acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 61-0 



ctgctgccag tggcgataag tcgrgtctta ccgggttgga c tcaagacga tagttaccgg 6 2 



ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 6 3 00 

cgacc:acac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgctccccg 6 3 0 j 

aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 64_ j 

gggagcttcc agggggaaac gcctggtatc tttacagtcc tgtcgggttt cgccacctct 6 4-: 0 

gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca 654 j 

gcaacgcggc ctttttacgg ttcctggccc tttgctggcc ttttgctcac atgttctttc 660 1 

ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 666~: 

ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 670 0' 

tgatgcggta ttctctcctt acgcatctgc gcggtatttc acaccgcata tggtgcactc 67-;. 

tcagtacaat ctgctctgat gccgcatagt taagccagta tacactccgc tatcgctacg 6 84 0 

:gactgggtc atggctgcgc cccgacaccc gccaacaccc gctgacgcgc cctgacgggc 69 DO 

:tgtc:gctc ccggcatccg cttacagaca agctgtgacc gtctccggga gctgcatgtg 6960 

t cagaggt t t t caccgtca: caccgaaacg cgcgaggcag ctigcggtaaa gc t cat: cage 7 0 2 •[ 

gtggtcgtga agegattcac agatgtctgc c eg t tea tec gcgtccagc: cgttgagtt: 708"' 

cticcagaagc gtcaatgtcc ggc::ctga: aaagegggee atgttaaggg eggtttttte 7140 

e z g t t z g g t c acttgatgee z c c g z g z a a g ggggaa:::c :g::ca:ggg ggtaatgata 
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ccgatgaaac gagagaggat gctcacgata cgggttactg atgatgaaca tgcccggtta 726C 

ctggaacgtt gtgagggtaa acaactggcg gtatggatgc ggcgggac :a gagaaaaatc 7 2 

actcagggtc aatgccagcg ctccgttaat acagatgtag gtgttccaca gg^tagccag 7 i -; 

cagcatcctg cgatgcagat ccggaacata atzjgtgcagg gcgctgactt ccgcgtttcc 744 

agactt:acg aaacacggaa accgaagacc attcatgttg ttgctcag^t cgcagacgtt 7 5 . [ 

teg cage age agtegcttca cgttcgeteg eg tat egg tg attcattetg ctaaccagta 756 

aggcaaeeee gccagcctag ccgggtcctc aaegacagga gcaegateat gcgeacccgt 76^ 

ggecaggaec caaegctgcc egagatgege egegtgcgge tgctggagat ggeggacgcg 7 6-; 

atggatatgt tetgecaagg gttggtttgc gcattcacag ttctccgcaa gaattgattg 7740 

getccaatte ttggagtggt gaatcegtta gcgaggtgec gceggcttec atteaggtcg 7 3'jO 

aggtggcecg gctccatgca ccgcgacgca aegeggggag gcagaeaagg tatagggegg 7 8 6 0 

cgcctacaat ccatgccaac ccgttccatg tgctcgccga ggeggeataa atcgccgtga 79.': 0 

cgatcagegg tccagtgatc gaagttaggc tggtaagage cgegagegat ccttgaagct 79-10 

gtecctgatg gtegtcatet acctgectgg acagcatggc ctgeaacgeg ggeateccga 8 '040 

tgccgecgga agegagaaga atcataatgg ggaaggecat ccageetegc gtc 8 00 3 

<210> 79 
<2LL> 30 9 3 
•: 2 L 2 > DNA 

<2 13> Artificial Sequence 
< 22 3> 

<22 3> Description of Artificial Sequence: Recombinant 

pAN4 2 9 plasmid 

< 4 0 3 > 7 9 

g^'-'jgecget: tcgtcgaccg aaacagcagt tataaggcat gaagctgtcc ggtttttgca 6 0 

aaagtggctg tgactgtaaa aagaaatcga aaaagaccg: tttgtgtgaa aacggtcttt 120 

ttgtttcctt ttaaccaact gccataacrc gaggcctacc tagcttccaa gaaagatatc 18\' 

ctaacagcac aagagcggaa agatgttttg ttctacatce agaacaacct ctgctaaaat 2 40 

tectgaaaaa ttttgcaaaa agttgttgac tttatctaca aggtgtggta taataatctt 3 

aacaacagca ggaegctcta gattagaaag gaggtttaat taatgtatcg tacgatgatg 3 6 

ageggcaaac ttcacagggc aactgttacg gaagcaaacc tgaactatgt gggaagcatt 42 

acaaz tgatg aagatctcat tgatgctgtg ggaatgette ctaatgaaaa agtacaaatt 48 

gtgaataata ataatggagc aegtcttgaa aegtatatta ttcctggtaa aeggggaage 54 

ggegtcatat gettaaaegg tgc agecgea cgccttgtgc aggaaggaga taaggtcatt 6 0 
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acaaaatgat gtctgatcaa gaagcggcaa gccatgagcc gaaagtggct 660 

gttctgaatg accaaaacaa aattgaacaa atgctgggga acgaaccagc ccgtacaact 720 

ttgnaaagga tcctgttttg gcggatgaga gaagattttc agcc:ga:ac agattaaat: 7-0 

agaacgcaga agcggtctga taaaacagaa tttgcctgg: ggcagtagcg cggtggtccc 84>! 

acc:gacccc a:gccgaact cagaa^tgaa acgc :gtago gccga:gg:a gtgtggggcc 90! 

tccccatgcg agagtaggga actgccaggc accaaataaa acgaaaggcc cagtcgaaag 9 00 

actgggcctt tcgttttatc tgttgt:ttgt cggtgaacgc tc:cc:gagc aggacaaarc 1 1 0 1 0. 

cgccgggagc ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc 10 - :: 

cgccataaac tgccaggcat caaatcaagc agaaggccat cctgacggat ggcctttttg 114 

cgtc:ccaca aacccttggt acccagaaaa agcggcaaaa gcggccgcca aaaaagcgaa 12 0 

accgaagaag ctgtctgccg ctaagacgga atarcaaaag cgt:c^gctg ttg:gt:carc 12 6 

tt:taaaagcc acagccgang aatcccagca aga:g:cc:a aaa:act:ga acacccagaa 13 2 

agataaagga aatgcagacc aaattcattc ttarratgrg gngaacggga t^gctgttca 1 3 

tgcc:caaaa gaggttatgg aaaaagtggc gcagcctccc gaagcggaaa aggtgcttcc 144 

taatgagaaa cggcagcttt ttaagtcatc ctccccact:t aatangaaaa aagcacagaa 100 

agc:at:aaa gcaactgacg gtgtggaatg gaacgcagac caaaccgacg ccccaaaagc 100 

"gggcactn ggatatgatg gaacrggcac ggtrgc tgcg tccactgaca ccggggtgga 162 

a:ggaa:ca" ccggcattaa aagagaaata tcgcggacat &,?LZ<^^gga.<3.^ arcctaatga 1 6 

gccngaaaa: gaaatgaact ggtatgatgc cgtagcaggc gaggcaagcc cttatgatga 174 

tttggctcat ggaacccacg tgacaggcac gatggcgggc tctgaacccg atggaacaaa l->0 

ccaaatcggt gragcacctg gcgcaaaatg gattgctgtt aaagcgttot ccgaagatgg 1'"'6 

cggcactgat gctgacattt tggaagctgg tgaatgggtt L'z^gca.cca.a. aggacgcgga 192 

aggaaatccc cacccggaaa tggctcctga tgttgtcaat aactcatggg gagggggctc 19 0 

tggacttgat gaatggtaca gagacatggn caatgcctgg cgttcggccg a:act:tccc 204 

rgagt tttca gcggggaata cggatctctt tattcccggc gggcctggtt c:arcgcaaa 2 10 

:ccggcaaac tatccagaat cgtttgcaac tggagcgact gagaattcca attccccatg 2 1.0 

gagagaaaag aaaatcgcta a:gt:gatta ctttgaactt c:gcatattc t t gaat t taa 2 .L 2 

aaaggcngaa agagtaaaag aLLglgctga aatattagag tataaacaaa atrgtgaaac 2^? 

aggcgaaaga aagtitgtatc gagtgtggtt ttgtaaatcc aggctit tgtc caatg:gcaa 23 4 

crggaggaga gcaatgaaac a:ggcat:ca gcca-:aaaag gttgctgctg aagttattaa 24 j 

acaaaagcca acagttcgtt: ggttgt ttct cacattaaca gttaaaaatg :::a:gar.gg 2 4 6 
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cgaagaanta aataagagtt tgtcagatat 
taaaaaaatt aataaaaatc ttgttggttt 
taaagataat tcttataatc agcacatgca 
taagaataca gaaaactacg tgaatcaaaa 
gaaattagac tatgatccaa atgtaaaagt 
atcggatata caatcggcaa ttgacgaaac 
tatgaccgat gatgaagaaa agaatttgaa 
ccgtaaaagg ttaatctcct atggtggttt 
tgatgacaca gaagaaggcg atttgattca 
tggattttct attattgcaa tgtggaattg 
gttcaacaaa cgggccatat tgttgtataa 
tatatgtggt aaaatgtttt aatcaagttt 
gtaacagggt tcaattaaaa gagggaagcg 
tcattattgg agggtgaaat gtgaatacat 
aaattttaat ttggctttgc attttatctt 
acgtctcatt acctgatatt gcaaatgatt 
tgaacacagc ctttatgtta accttttcca 
atcaattagg catcaaaagg ttactcctat 
taattgggtt tgttggccat tctttctttt 
gggc tggtgc agctgcattt ccagcactcg 
aggaaaatag gggtaaagca tttggtctta 
tcggtccagc gattggtgga atgatagccc 
ttcctatgat aacaattatc actgttccgt 
ggataaaagg tcattttgat atcaaaggaa 
ttatgttgtt tacaacatca tatagcattt 
tgatatttgt aaaacatatc aggaaagtaa 
aaaatatacc ttttatgatt ggagttcttt 
ggtttgtctc tatggttcct tatatgatga 
tcggaagtg: aattattttc cctggaacaa 
ggatacttgt rgatagaaga ggtcctttar 
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ggctcaagga tttcgccgaa tgatgcaata 2520 
tatgcgtgca acggaagtga caataaataa 2580 
tgtattggta tgtgtggaac caacttattt 2 6 4 0 
acaatggatt caattttgga aaaaggcaat 27 j j 
tcaaatgatt cgaccgaaaa ataaatataa 276 J 
tgcaaaatat cctgtaaagg atacggattt 28 JO 
acgtttgtct gatttggagg aaggtttaca 2880 
gttaaaagaa atacataaaa aattaaacct 2940 
tacagatgat gacgaaaaag ccgatgaaga 3 0v0 
ggaacggaaa aattatttta ttaaagagta 30-00 
gtgatgaaat actgaattta aaacttagtt 3220 
aggaggaatt aattatgaag tgtaatgaat 3180 
tatcattaac cctataaact acgtctgccc 3240 
cctattcaca atcgaattta cgacacaacc 3 3 00 
tttttagcgt attaaatgaa atggttttga 3360 
ttaataaacc acctgcgagt acaaactggg 3420 
ttggaacagc tgtatatgga aagctatctg 3480 
ttggaattat aataaattgt ttcgggtcgg 3540 
ccttacttat tatggctcgt tttattcaag 3600 
taatggttgt agttgcgcgc tatattccaa 3660 
ttggatcgat agtagccatg ggagaaggag 3720 
attatattca ttggtcctat cttctactca 3780 
ttcttatgaa attattaaag aaagaagtaa 3340 
ttatactaat gtctgtaggc attgtatttt 3 9 >0 
cttttcttat cgttagcgtg ctgtcattcc 396 3 
cagatccttt tgttgatccc ggattaggga 4j2D 
gtgggggaat tatatttgga acagtagcag 43- j 
aagatgttca ccagctaagt actgccgaaa 41 4 J 
tgagtgtcat tattttcggc tacartggtg 42:3 
acgtgttaaa catcggag:t acatttcttt 4260 
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ctgctagctt tttaactgct tcctttcttt tagaaacaac atcatggntic atgacaatta 4 3 2 3 

taatcgtatt tgttctaggt gggct t tcgt tcaccaaaac ag::a:acca acaattgctt 43 1 

caagtagctt gaaacagcag gaagctggtg ctggaatgag t::gc:taac tccaccagct 444 ~ 

ttctatcaga gggaacaggt a::gcaa:tg taggtggt t i actacccata ccct:act:g 45i !■ 

atcaaaggtt gttacctatg gaagttgacc agtcaactta tcrgtatagt aatttgttat 451"" 

tac:-:t:ttc aggaatcat: g:ca:tagct ggctggtitac cctgaatgta tataaacatt 462 ; 

ctcaaaggga cttctaaatc gttaagggat caactttggg agagagttca aaattgatcc 46. : "■ 

tttttttaca acagttcgaa gcggccgoaa ctcttgaaga cgaaagggcc tcgtgacacg 474 0 

cctattttra taggttaatg tcatgataat aatggtttct tagacgtcag gcggcacttt 4 S 0 0 

tcggggaaat gtgcgcggaa cccccatttg tttatttttc caaacacatt caaatatgca 4H60 

tccgctcacg agacaa:aac cctgataaat gcttcaataa :attgaaaaa ggaagagtat 4^1': 

gagtattcaa catttccgtg tcgccctcat tcccttnttt gcggcatttt gccttcctgt 4 ■-■ j 

ttctgctcac ccagaaacgc tggtgaaagt aaaaga'gcc gaagatcagt tgggtgcacg 504 j 

agtgggtitac atcgaactgg accrcaacag cggtaagatc cttgagagtt ttcgccccga 5.1. '"'0 

agaacgtttt ccaatgacga gcaccttcaa agttctgcta rgtggcgcgg tattatcccg 5160 

ta^cgacgcc gggcaagagc aact :ggccg ccgcatacac :att-::caga acgacttggt 51.21 

tgagcactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 51..0; 

cagtgctgcc acaaccacga gtgacaacac tgcggccaac ctacctctga caacgatcgg 5 3 4 0 

aggaccgaag gagctaaccg cttttttgca caacatgggg gatcangtaa ctcgcctrga 54 0 0 

tcgctgggaa ccggagctga angaagccat accaaacgac gagcgtgaca ccacgacgcc 546 1 

tgcagcaatig gcaacaacgc tgcgcaaact attaaccggc gaactactta ctctagcttc 500;"' 

ccggcaacaa t:aatagact ggatggaggc ggataaagtc gcaggaccac ttctgcgccc 55~ 1 >1 

ggcccttccg gctggctggt ttattgctga taaatctgga gccggrgagc gcgggtcccg 564 '. 

cggtatcatt gcagcactgg ggccagacgg taagccctcc cgtaccgtag ttatctacac 5 '7 0 0 

gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 57 Or; 1 

actgattaag cattggtaac tgtcagacca agtttactca :ata:acttt agattgat:: 5 2 ~: 

aaaacttcat ::ttaat:ta aaaggatcra ggtgaagat: c:tt::gata a:ctca:gac 5 : . ^ 1 

caaaatccct taaogcgag: z z ucg t toja ctgagcgtca gacc:cgtag aaaaga:caa 5v4 1 

agga:c:tct tgagat cc t ti t t:t:t tctgcg cgtaatctg: tgctitgcaaa caaaaaaacc 6 1 J 1 

ac:gc:acca g:ggtgg:t" gtttgccgga tcaagagcta ccaactic t: t ti ::ccgaagj: 

aartiggctitic ageagageg: agaiaccaaa tiactigtcctrti ciaglglagc cgiagila_^g 610 
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eeaceaette aagaactctg tagcaccgcc tacatacctc gct:tg:taa tcctgttacc 6180 

agtggetget gccagtggcg ataagt:g:g icctaccggg ctggactcaa gacgatagtt 624 

accgga:aag gcgcagcggt cgggetgaae ggggggtteg cgcacacagc ccagcttgga 63 J J 

gcgaacgacc tacaccgaac tgagatacc: acagcgtgag ctatgagaaa gcgccacgct 636 j 

teeegaaggg agaaaggcgg acaggta:cc ggtaagegge agggtcggaa caggagagcg 6 42 j 

cacgagggag ct:ccagggg gaaacgcctg gta:cttta: ag:c-::gtcg gg:t:cgcca 64 3 j 

cccc:gacc: gagcgtcgat ttrcg:ga:g ctcgtcaggg gggcggagcc :a:ggaaaaa 6 64 0 

cgccagcaac gcggcct" tt tacgg:tcc: ggcctcttgc :ggcc:tc:g ctcacatgtt 6600 

ettteetgeg ttatcccctg atnctgtgga taaccgta:: accgcctc:g agtgagctga 666 J 

:accgc:cgc cgcagccgaa cgaccgagcg cagcgagcca gtgagcgagg aagcggaaga 6 7 2 0 

gcgcctgatg cggtattttc tcc:tacgca tctgtgeggt at:ccacacc gcatatggtg 67 30 

cactctcagt acaatctgct ctgatgccgc atagttaagc eagtataeac cccgctatcg 6.340 

ctacgcgact gggtcatggc tgcgccccga ca<z<2cgcca.a. cacccgccga cgcgccctga 69 JO 

cgggctngcc tgctcccggc atccgcccac agacaagccg tgaccg:ccc egggagetge 60 60 

a:g:gc :aga ggttttcacc gtca:caccg aaacgcgcga ggcagctgcg gtaaagctca 702 0 

"cagcgtggt cgcgaagcga ctcacagatg tccgcctgtt catccgcgtc cagc:cgt:g 7 0-10 

ag::ccccca gaagcgtcaa tgtctggc:: ctgataaagc gggcca:gct aagggcggtt 7140 

ttttcctgtt tggtcacttg atgcctccg: gtaaggggga atc:c:gttc atgggggtaa 72 JO 

"gacaccgat gaaacgagag aggatgetea egataegggt taetgatgat gaacatgeee 12 60 

ggttaetgga acgttgtgag ggtaaaeaae tggeggtatg gatgeggegg gaccagagaa 73 2 0 

aaazcaczca. gggtcaatgc eagegetteg ttaatacaga tgtaggtgtt ccacagggta 73 30 

gccagcagca tectgegatg cagateegga acataatggt geagggeget gaettcegeg 744 j 

"teeagaet ttacgaaaea eggaaaeega agaeeattea tgttgttgct eaggtcgeag 7SJ0 

aegttttgea gcagcagteg etteaegtte getegegtat eggtgactea ttetgctaae 7 560 

eagtaaggea aceccgeeag eetageeggg teeteaaega eaggageaeg ateatgegea 7620 

eeegtggcea ggacccaaeg etgeeegaga tgegeegegt geggetgetg gagatggegg 7 6 ^ ~ 

aegegatgga tatgttetgc eaagggttgg tttgegeatt eaeagttete egeaagaatt 774 C 

ga:.:gg-:tcc aattcttgga gtggtgaate cgttagegag gtgeegeegg etteeattea 7 8 "i C 

gg:cgagg:g gcccggetec atgeaeegeg aegeaaegeg gggaggcaga eaaggtatag 

ggeggegcet acaatccatg eeaaeeegtt cea tgtgete geegaggegg cataaatege 
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cgtgacgatc agcggtccag tgatcgaagt taggctggta agagccgcga gcgatccttg 7980 
aagctgtccc tgatggtcgt catctacctg cctggacagc atggcctgca acgcgggcat 834: 
cccgatgccg ccggaagcga gaagaaccat aatggggaag gccatccagc ctcgcgtc 809 3 

<210> 8 3 
<211> 4450 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN443 plasmid 

<400> 80 _ 
ccatcgaatg gccagatgat taattcctaa tttttgttga cactctatca ttgatagagt 60 

tattttacca ctccctatca gtgatagaga aaagtgaaat gaatagttcg acaaaaatct 12C 

agattagaaa ggaggattta aatatgagac agattactga tatttcacag ctgaaagaag 180 

ccataaaaca ataccattca gagggcaagt caatcggatt tgttccgacg atggggtttc 240 

tgcatgaggg gcatttaacc ttagcagaca aagcaagaca agaaaacgac gccgttatta 300 

tgagtat.ttt tgtgaatcct gcacaattcg gccctaatga agattttgaa gcatatccgc 360 

gcgatat.tga gcgggatgca gctcttgcag aaaacgccgg agtcgatatt ctttttacgc 420 

cagatgctca tgatatgtat cccggtgaaa agaatgtcac gattcatgta gaaagacgca 430 

cagacgtgtt atgcgggcgc tcaagagaag gacattttga cggggtcgcg atcgtactga 540 

cgaagctttt caatctagtc aagccgactc gtgcctattt cggtttaaaa gatgcgcagc 600 

aggtagctgt tgttgatggg ttaatcagcg acttcttcat ggatattgaa ttggttcctg 660 

tcgatacggt cagagaggaa gacggcttag ccaaaagctc tcgcaatgta tacttaacag 720 

ctgaggaaag aaaagaagcg cctaagctgt atcgggccct tcaaacaagt gcggaacttg 780 

tccaagocgg tgaaagagat cctgaagcgg tgataaaagc tgcaaaagat atcattgaaa 840 

cgactagcgg aaccatagac tatgtagagc tttattccta tccggaactc gagcctgtga 900 

atgaaattgc tggaaagatg attctcgctg ttgcagttgc tttttcaaaa gcgcgtttaa 960 

tagataatat cattattgat attcgtagaa aggaggtgaa ttaatatgta tcgtacgatg 1020 

atgagcggca aacttcacag ggcaactgtt acggaagcaa acctgaacta tgtgggaagc 1C30 

attacaattg atgaagatct cattgatgct gtgggaatgc ttcctaatga aaaagtacaa 1140 

attgtgaata ataataatgg agcacgtctt gaaacgtata ttattcctgg taaacgggga 12 00 

agcggcgtca tatgcttaaa cggtgcagcc gcacgccttg tgcaggaagg agataaggtc 126D 

attattattt cctacaaaat gatgtctgat caagaagcgg caagccatga gccgaaagtg 1320 
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gctgttctga atgatcaaaa caaaattgaa caaatgctgg ggaacgaacc agcccgtaca 13S0 
attttgtaaa ggatcccccg gggatccctc gaggtcgacc tgcaggggga ccatggtctc 144:' 
agcgcttgga gccacccgca gttcgaaaaa taataagctc gacccgtgaa gtgaaaaatg 15>j 
gcgcacattg tgcgacattt tttttgtctg ccgtttaccg ctactgcgtc acggacctcc 15-f J 
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 16.0 
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc cttctcgcca lo-:> 
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 174:, 
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 19 00 
catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg 1**1 
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 1920 
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 19 -.0 
acgcgaattt taacaaaata ttaacgctta caatttcagg tggcactttt cggggaaatg 2040 
tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 2100 
gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 2160 
atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 2220 
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 2. >'0 
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 2340 
caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg 2400 
ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 2460 
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 2520 
taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 25*0 
agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 2640 
cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg j -5 0 
caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 2760 
tgatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 2*20 
ctggctggtt tattgctgat aaatctggag ccggtgagcg tggctctcgc ggtatcattg 2^0 
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 2-40 
aggcaactat ggatgaacga aatagacaga tcycLyagaL "y^y^-" - — d => ~ - ' 

attggtagga attaatgatg tctcgtttag ataaaagtaa agtgattaac agcgcattag -:60 
agctgcttaa tgaggtcgga atcgaaggtt taacaacccg taaactcgcc cagaagctag -120 
gtgtagagca gcctacattg tattggcatg taaaaaataa gcgggctttg ctcgacgcct 3 1«0 
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tagccattga gatgttagat aggcaccata ctcacttttg ccctttagaa ggggaaagct 324C 
ggcaagattt tttacgtaat aacgctaaaa gttttagatg tgctttacta agtcatcgcg 2 30l 
atggagcaaa agtacattta ggtacacggc ctacagaaaa acagtatgaa actctcgaaa -3 
atcaattagc ctttttatgc caacaaggtt tttcactaga gaatgcatta tatgcactca 3 
gcgcagtggg gcattttact ttaggttgcg tattggaaga tcaagagcat caagtcgcta 3 4 -' j 
aagaagaaag ggaaacacct actactgata gtatgccgcc attattacga caagctatcg 354;.' 
aattatttga tcaccaaggt gcagagccag ccttcttatt cggccttgaa ttgatcatat 36 j j 
gcggattaga aaaacaactt aaatgtgaaa gtgggtctta aaagcagcat aacctttttc 3 'i n,) 
cgtgatggta acttcactag tttaaaagga tctaggtgaa gatccttttt gataatctca 37^'J 
tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga 37-; 3 
tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa 3-340 
aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga 3<300 
aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt 3960 
taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt 4020 
taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat 40.30 
agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct 4140 
tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca 43)0 
cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 4260 
agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 4320 
gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga 43 3] 
aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca 444 D 
tgacccgaca ' ±4 ' )U 

<210> 81 
<211> 10212 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN251 plasmid 

< 4 0 0 > 81 

gcggccgcta aaaagagctt gaggatttgc ggagtgaaaa tcagacattg cggaatcagc 6C 

tagagatgac agaagaggat tacaaggcac tgatcgatat catggatcgg gccagaaaaa ill 

tggttgtttc gaaggaagac ggaagaatga aaaaagcggc tcaagaaacg taaagaaacg 1 3 -J 
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cc tgaaatga accggcccta tagtaagaat aggccggttg ttttgatttc tatgcagact 240 
ctcccggtgt catttcgcga tccatarcag gatgccagat gagcgggtct tcccctttgt 300 
cccgcgccat atcatactta acagttttaa agttcatttt gttccaaaat tccgctgatt 360 
tcar:ctcgg atttgtccgg atcggcattt tgaatgattt tgcaaattca accaaggctc 4^0 
tcccgtatcc cctgttctgg tagcccggaa gaacctcaag cttccacagc tccaaataat 4--- .'• 
cctggcggtt gtcaaaatag ggattcgatt tgccgttaac ttgatacaga ctcattcgtg 5 4 ■■ 
ctacaagttt atcgccaaaa taaatcccgt aaaaaggcga ggtgctgtca ttttcaataa 6 30 
tattatcctg aagttcttca agcattgaaa gctcctgaat gccgtattct ttgaatttct 6 6 
tgaattcttc cagcgtttta tagttgataa gcagacgttc tacctttgtc aaacaaatct 7 0 
ccccctttgt tgtttctaca tatattgtaa acgctttatt taaaaaatcc aaatatt taa 7 0 
actttaattt taagcacatg ggatctttga gaagtaattt cttcttactt ctgctatgat 34 
aatacgtaaa tgcgtcgacc gaaacagcag ttataaggca tgaagctgtc cggtttttgc 9 0 
aaaagtggct gtgactgtaa aaagaaatcg aaaaagaccg ttttgtgtga aaacggtctt 9 6 
tttgtttcct tttaaccaac tgccataact cgaggcctac ctagcttcca agaaagatat 1000 
cc taacagca caagagcgga aagatgt tt t gttctacatc cagaacaacc tctgctaaaa 1;j 3 0 
ttcctgaaaa attttgcaaa aagttgttga ctttatctac aaggtgtggt ataataatct 1340 
taacaacagc aggacgctct agaggaggag acatcatgaa aattggaatt atcggcggag 1 0 0 0 
gctccgttgg tcttttatgc gcctattatt tgtcacttta tcacgacgtg actgttgtga 1060 
cgaggcggca agaacaggct gcggccattc agtctgaagg aatccggctt tataaaggcg 1300 
gggaggaatt cagggctgat tgcagtgcgg acacgagtat caattcggac tttgacctgc 1330 
ttgtcgtgac agtgaagcag catcagcttc aatctgtttt ttcgtcgctt gaacgaatcg 1440 
ggaagacgaa tatattattt ttgcaaaacg gcatggggca tatccacgac ctaaaagact 150 3 
ggcacgttgg ccattccatt tatgttggaa tcgttgagca cggagctgta agaaaatcgg 1560 
atacagctgt tgatcataca ggcctaggtg cgataaaatg gagcgcgttc gacgatgctg 1600 
aaccagaccg gctgaacatc ttgtttcagc ataaccattc ggattttccg atttattatg 1630 
aga:ggattg gtaccgtctg ctgacgggca agctgattgt aaatgcgtgt attaatcctt 174 3 
taaccgcgtt attgcaagtg aaaaatggag aactgctgac aacgccagct tatctggctt 13 0 0 
ttatgaagct ggtatttcag gaggcatgcc gcattttaaa acttgaaaat gaagaaaagg 136 3 
cttgggagcg ggttcaggcc gtttgcgggc aaacgaaaga gaatcgtnca "caatgctgg 132 3 
ttgacgtcat tggaggccgg cagacggaag ctgacgccat tatcggatac ttattgaagg 103 3 
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aagcaagtct tcaaggtctt gatgccgtcc acctagagtt tttatatggc agcatcaaag 2040 

cattggagcg aaatacaaac aaagtctttt gagctttttc ggtaacatgc tatactcatt 2 1 0 J 

tcggatcact aactatttat tggagaaagg aagttctaga agatgcagct aactgaactt 2160 

tccancaaaa atcagaatgt gtttgtacag cactatatag atggcaaaga agaaatgtct 222") 

tctttttttg attacagtat tcatcataag gacatgtggc gcgaaagact ggaagactta 22 S J 

tcttcccggt ttttcgcaag agaggaattg gcggcgtact taacccctta ccataataaa 2 3 40 

ttcggttcaa gtgcgatgca gtctgctatt gagaagctga aggacccgtc aagtgccgct 2400 

gtagtcggcg gacagcaggc aggactttta acaggaccgc tttacaccat acataaaatc 2460 

at:tcaatca ttgttttagc aaagcaacaa gaaaaggaac tgcaagtgcc tgtcatacca 2520 

atctcctggg tggctggaga agaccacgat ttggatgaga ttaattttgt tcacacatct 2530 

gaagagaatg ggcctgtgaa aaaaaagctg ce rcagtc t t attggaagaa atcatcagca 2640 

gcgagtacat cgcttgatca ggaaaagtgt gccgcgtgga tagatgatgt ttttgccgct 27 00 

tttgaagaaa cagaccatac gaatacactt ctcgacaatg tgaaacgatg tttaagggaa 27 60 

tctgttacgt ttactgactt ctttgaactg ctgatcgcgg atttgttcca agaagagggc 2320 

ttagttctat taaattctgg ggatcctgtt tcggcggatg agagaagatt ttcagcctga 2 3 30 

tacagattaa atcagaacgc agaagcggtc tgataaaaca gaatttgcct ggcggcagta 2 9 4 J 

gcgcggcggt cccacctgac cccatgccga acrcagaagt gaaacgccgt agcgccgatg 3 0 0 0 

gtagngtggg gtctccccat gcgagagtag ggaactgcca ggcatcaaat aaaacgaaag 3-060 

gctcagtcga aagactgggc ctttcgtttt atctgttgtt tgtcggtgaa cgctctcctg 3 120 

agtaggacaa atccgccggg agcggatttg aacgttgcga agcaacggcc cggagggtgg 3 180 

cgggcaggac gcccgccata aactgccagg catcaaatta agcagaaggc catcctgacg 3 2 40 

gatggccttt ttgcgtttct acaaactctt ggtacccaga aaaagcggca aaagcggctg 33 00 

ttaaaaaagc gaaatcgaag aagctgtctg ccgctaagac ggaatatcaa aagcgttctg '}^60 

ctgttgtgcc atctttaaaa gtcacagccg atgaatccca gcaagatgtc ctaaaatact 3420 

tgaacaccca gaaagataaa ggaaatgcag accaaattca ttct tattat gtggtgaacg 34 SO 

ggattgctgt tcatgcctca aaagaggtta tggaaaaagt ggtgcagttt cccgaagtgg 3540 

aaaaggtg-rt tcctaatgag aaacggcagc tttttaagtc atcctcccca tttaatatga 36 DO 

aaaaagcaca gaaagctart aaagcaacrg acggtgtgga atggaatgta gaccaaatcg "^6-j 

atgccccaaa agcttgggca cttggata'g atggaactgg cacgg^tgtt gcgtccattg 3 ~" 2 0 

ataccggggt ggaatggaat catccggca: taaaagagaa atatcgcgga tataatccgg 3 "8 0 

aaaatcctaa tgagcctgaa aatgaaatga actggtatga tgccgtagca ggcgaggcaa *-43' 
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gcccttatga tgatttggct catggaaccc acgtgacagg cacgatg^tg ggc^ctgaac 3 -:0 3 

ctgatggaac aaatcaaatc ggtgtagcac ctggcgcaaa argga^tgcr gttaaagcgt 3 6 3 

tctctgaaga tggcggcact gatgctgaca ttttggaagc tggtgaatgg gttttagcac 4 jj 3 

caaaggacgc ggaaggaaat ccccacccgg aaatggctcc tgatgttgtc aataactcat 4 .> -i 3 

ggggsggggg ctctggactt gatgaatggt acagagaca: ggtcaatgcc tggcgttcgg 

ccgatanttt ccctgagttt tcagcgggga atacggatc: ctttattccc ggcgggcctg 43 3 3 

gttctatcgc aaatccggca aactatccag aatcgtttgc aactggagcg actgagaatt 436 3 

ccaattcccc atggagagaa aagaaaatcg ctaatgttga ttactttgaa cttctgcata 4:3 3 

ttcttgaatt taaaaaggct gaaagagtaa aagattgtgc tgaaatatta gagtataaac 4.'. 

aaaatcgtga aacaggcgaa agaaagttgt atcgagtgtg gttttgtaaa tccaggcttt 4440 

gtccaatgtg caactggagg agagcaatga aacatggcat tcagtcacaa aaggttgttg 4 1 j j 3 

ctgaagttat taaacaaaag ccaacagttc gttggttgtt tctcacatta acagttaaaa 4 3 6 0 

atgtttatga tggcgaagaa ttaaataaga gtttgtcaga tatggctcaa ggatttcgcc 4630 

gaatgatgca atataaaaaa attaataaaa atcttgttgg ttttatgcgt gcaacggaag 4": '--0 

tgacaataaa taataaagat aattct:ata atcagcacat gcatgtattg gtatgtgtgg 4740 

aaccaactta ttttaagaat acagaaaact acgtgaatca aaaacaatgg attcaatttt 4-; ,_ 0 

ggaaaaaggc aatgaaatta gactatgatc caaatgtaaa agttcaaatg attcgaccga 4^6 3 

aaaataaata taaatcggat atacaatcgg caattgacga aactgcaaaa tatcctgtaa 4930 

aggatacgga ttttatgacc gatgatgaag aaaagaattt gaaacgtttg tctgatttgg 4'r^O 

aggaaggttt acaccgtaaa aggttaatct cctatggtgg tttgttaaaa gaaatacata ^040 

aaaaattaaa ccttgatgac acagaagaag gcgatttgat :catacagat gatgacgaaa liilCO 

aagccgatga agatggattt tctattattg caatgtggaa ttgggaacgg aaaaattatt bltfO 

ttattaaaga gtagttcaac aaacgggcca tattgttgta caagtgatga aatactgaat 3330 

ttaaaactta gtttatangt ggtaaaatgt tttaatcaag tttaggagga attaattatg 30P.0 

aagtgtaatg aatgtaacag ggttcaatta aaagagggaa gcgtatcatt aaccctataa c >340 

actacgtctg ccct.ca.tz&^ tggagggtga aatgtgaata catcctattc acaatcgaat >i ;'' 0 

ttacgacaca accaaatttt aatttggctt tgcattttat ctttttttag cgtattaaat 3460 

gaaatggtrt tgaacgtctc attacctgat attgcaaatg attttaataa accacctgcg 3 : : 2 0 

agtacaaact gggtgaacac agcct::atg ttaaccttit ccattggaac agctgtatat 36^0 

ggaaagctat ctgalcaar.: aggcatcaaa aggttactcc tatttggaa: ta:aa:aaa: 3 643 
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tgtttcgggt cggtaattgg gtttgttggc 
cgttttattc saggggctgg tgcagc tgca 
cgctatattc caaaggaaaa taggggtaaa 
atgggagaag gagtcggtcc agcgattggt 
tatcttctac tcattcctat gataacaatt 
aagaaagaag taaggataaa aggtca 1 1 1 1 
ggcattgtat tttttatgtt gtttacaaca 
gtgctgtcat tcctgatatt tgtaaaacat 
cccggattag ggaaaaatat accttttatg 
ggaacagtag cagggtttgt ctctatggtt 
agtactgccg aaatcggaag tgtaattatt 
ggctacattg gtgggatact tgttgataga 
gttacatttc tttctgttag ctttttaact 
ttcatgacaa ttataatcgt atttgtttta 
t caacaattg 1 1 tcaagtag c t tgaaacag 
aactttacca gctttttatc agagggaaca 
atacccttac ttgatcaaag gttgttacct 
agtaatttgt tattactttt ttcaggaatc 
gtata taaac at tc tcaaag gga 1 1 tc taa 
tcaaaattga tccttttttt ataacagt tc 
gcctcgtgat acgcctattt ttataggtta 
caggtggcac ttttcgggga aatgtgcgcg 
attcaaatat gtatccgctc atgagacaat 
aaaggaagag tatgagtatt caacatttcc 
tttgccttcc tgtttttgct cacccagaaa 
agt tgggtgc acgagtgggt tacatcgaac 
gttttcgc cc cgaagaacgt tttccaatga 
cggtattatc ccqtattqac qccgggcaag 
agaatgactt ggttgagtac tcaccagtca 
taagagaatt atgcagtgct gccataacca 
tgacaacgat cggaggaccg aaggagctaa 



-111- 




cattctttct tttccttact tattatggct 5700 
tttccagcac tcgtaatggt tgtagttgcg 576 3 
gcatttggtc ttattggatc gatagtagcc 530 3 
ggaatgatag cccattatat tcattggtcc 5 3-') 
atcactgttc cgtttcttat gaaattatta 594] 
gatatcaaag gaattatact aatgtctgta 6 3 -0 0 
tcatatagca tttcttttct tatcgttagc 6000 
atcaggaaag taacagatcc ttttgttgat 6100 
attggagttc tttgtggggg aattatattt 
ccttatatga tgaaagatgt tcaccagcta 6240 
ttccctggaa caatgagtgt cattattttc 63 00 
agaggtcctt tatacgtgtt aaacatcgga 6360 
gc ttcctttc tt t tagaaac aacatcatgg 6 4 0 0 
ggtgggcttt cgttcaccaa aacagttata 6400 
caggaagctg gtgctggaat gagtttgctt 6540 
ggtat tgcaa t tgtaggtgg tt tat tat cc 6 6 0 0 
a tggaagt tg a tcagtcaac t tatc tgtat 6 6 6 0 
attgtcatta gttggctggt taccttgaat 670 0 
atcgttaagg gatcaacttt gggagagagt 67s0 
gaagcggccg caattct tga agacgaaagg 6 34 0 
atgtcatgat aataa tggt t tc t tagacgt 6 9 0 0 
gaacccctat ttgtttattt ttctaaatac 696 0 
aaccctgata aatgcttcaa taatattgaa 7000 
gtgtcgccct tattcccttt tttgcggcat 70*0 
cgctggtgaa agtaaaagat gctgaagatc 7140 
tggatctcaa cagcggtaag a tec 1 1 gaga 7 2 0 0 
tgagcacttt taaagttctg ctatgtggcg 7160 
ageaactegg tcgccgcata cactattctc 7 2 0 
cagaaaagca tc t tacgga t ggca tgacag 7 2 R 2 
tgagtgataa cactgcggcc aacttacttc 7441 
ccgctttttt gcacaacatg gggga t ca tg 7 2 0 j 
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taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 756 

acaccacgat gcctgcagca atggeaacaa cgttgcgcaa actattaact ggcgaactae 7 

t:ac:ctagc ttcccggcaa caat:aatag actggatgga ggeggataaa gttgcaggac 76- j 

cac::ctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 7740 

agegtgggte tcgcggtatc attgcagcac tggggccaga tggtaagccc :c:cg:atcg 7-1 : j 

tag::atcta cacgacgggg agtcaggcaa ctatggatga acgaaacaga cagatcgctg 7 8'-" 0 

agacaggcgc cteactgatt aagcattggt aactgccaga ccaagt:tac tcatatatac 79^0 

t::agatcga ttcaaaactt cattcctaat ttaaaaggat ctaggtgaag atcct^tttg 79 

a:aa:ctca: gaccaaaatc ccttaacgtg agttttegtt ccaccgagcg ccagaccccg 3 0 4 0 

tagaaaaga: caaaggatct: tcttgagatc ctttttctct gcgcgtaatc :gctgcttgc 8 190 

aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggaieaagag ctaccaactc 81r'0 

tc:ttccgaa gg:aac:ggc ttcagcagag cgcagacacc aaa:accgcc ctnctagtgt 8110 

age eg tag tt aggecaeeae ttcaagaact ctgtageaee gcctacatac etegctctge 81-0 

taatcctgcc aceagtggct getgeeagtg gcgataagtc gtgcettaec gggttggact 8110 

eaagacgata gttaceggat aaggegcage ggtcgggetg aacggggggt negtgeaeae 3 4 0 0 

ageeeagett ggagcgaacg aectaeaeeg aactgagata cetaeagegt gagctatgag 34 6 j 

aaagcgccac gcctcccgaa gggagaaagg eggacaggta teeggtaage ggeagggtcg 8^:0 

gaaeaggaga gegcaegagg gagcttccag ggggaaaege ctggtatett tatagtcetg 35 80 

tegggttteg ccacctetga ettgagegte gatttttgtg atgetegtea ggggggcgga 8 6 4 0 

geetatggaa aaacgeeage aacgeggeet ttttaeggtt cetggeettt tgetggcett 87 0 0 

ttgetcaeat gttctttect gegttateee ctgattetgt ggataaeegt attacegect 8760 

ttgagtgage tgataeegct cgcegeagee gaacgaeega gegeagegag tcagtgagcg 88.":) 

aggaagegga agagegeetg atgeggtatt ttctcettae geatetgtgc ggtattteae 3 3-80 

aeegeatatg gtgeaetete agtaeaatet getctgatge cgcatagcca agecagtata 8940 

eaetecgeta tegctaegtg aetgggteat ggctgcgccc cgaeaecegc caacacccgc 9000 

tgaegcgeee tgaegggett gtctgeteee ggcateeget tacagacaag etgtgacegt 906 0 

eteegggage tgeatgtgte agaggtttte acegteatea cegaaaegeg egaggeagct 911 j 

geggtaaage tcatcagegt ggtcgtgaag cgatteaeag atgtetgeet gttcatcegc 9180 

gtceagcteg ttgagtttct ceagaagegt taatgtetgg cttetgataa agegggceat 92 4 1 " 

gttaagggeg gttttttect gtttggtcae ttgatgeete cgtgtaaggg ggaatttetg 93 0 j 
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1 t :a tggggg taatgatacc ga tgaaacg a gagagga tgc tcacgatacg ggrtactgat 9 3 6 D 

gatgaacatg cccggttact ggaacgttgt gagggtaaac aactggcggt atggatgcgg 942 j 

cgggaccaga gaaaaatcac tcagggteaa tgccagcgct tcgttaatac agatgtaggt 94 3 j 

gttccacagg gtagrcagca gcatcctgcg acgcagatcc ggaacataat ggtgcagggc 9 54 j 

gctgacttcc gcgt:tccag actttacgaa acacggaaac cgaagaccat tcatgttgtt 960 J 

gctcaggtcg cagacgtttt gcagcagcag tcgcttcacg ttcgctcgcg tatcggtgat 9 6 6 0 

t::a::ctgc t aaccagtaag gcaaccccgc cagcctagcc gggtcc tcaa cgacaggagc 9 71 0 

acga tea tgc gcacccg tgg ccaggaccca acgctgcccg agatgegecg cgtgcggc tg 9 7 3 - j 

ctggagatgg eggacgegat ggatatgttc tgccaagggt tggtttgcgc attcacagtt 9340 

cteegcaaga attgattggc tccaattctt ggagtggtga ateegttage gaggtgeege 9900 

cggcttecac tcaggtcgag gtggcccggc tccatgcacc gcgacgcaac geggggagge 9 960 

agacaaggca tagggeggeg cctacaatcc atgccaaccc gt tcca tg tg c tegecgagg 1 0 0 2 0 

eggcataaat cgccgtgacg ateageggtc cagtgatega agttaggctg gtaagagccg 10 J-0 

cgagcgatcc ttgaagctgt ccctgatggt cgtcatctac ctgcctggac ageatggect 1014 0 

geaacgeggg catcccgatg ccgccggaag cgagaagaa t ca taa tgggg aaggccatcc 1 02 0 0 

agcctcgcgt eg 10212 

<2 1D> 32 
<211> 10426 
<212> DNA 

<2 1 3 > Art i f ic ia 1 Sequence 

<220> 

<2 2 3 > Descr ipt i on o f Ar t i f ic ial Sequence : Recombinant 
pAN2 67 plasmid 

<402> 32 

aacaaaa t tc cccagtcttc aca teggt 1 1 gaaaggagga ageggaagaa tgaag taaga 6 0 

ggga 1 1 1 1 tg actccgaagt aagtcctcaa aaaa tcaaa t aaggagtgtc aagaatgrr t 12 0 

gcaaaacgat tcaaaacctc 1 1 1 a c t g c c g 1 1 a 1 1 c g c t g g a 1 1 1 1 1 a 1 1 gctgtttcat 1 3 0 

ttggttctgg caggaccggc ggctgcgagt getgaaaegg cgaacaaatc gaatgagctt 240 

acagcaccgt cgatcaaaag cqqaacca tt ettcatgeat ggaa t tgg tc gt t caa t acg 2 0 0 

t r aaaacaca a ta tgaagga ta 1 1 ca tga t gcagga ta ta cageca 1 1 ca gaca tc tccg 2 6 0 

a 1 1 a a c c a a g :aaaggaagg gaatcaagga gataaaagca t g t c ga a c t g g t a c t g g c t g 4 2 0 

tatcagcega categtatea aattggcaac cat tact tag gtactgaaca agaatttaaa 4^0 

gaaa t g t g t g cagccqc t ga agaa t a t ggc a taaagg t ca 1 1 g 1 1 gaege ggt ca t caa t :■ 4 0 
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cataccacca gtgattatgc cgcgatttcc aatgaggtta agagtattcc aaactggaca 6 0 0 

catggaaaca cacaaattaa a a actggtc t gatcgaaata gtacataatg gatttcc rta 6 6 !"> 

cgcgaaatac gggcagacat ggcctgcccg gttattatta tttttgacac cagac :aact 7 2 !'■ 

ggtaatggta gcgaccggcg ctcaggatcg tctcggtacc aagagtttgt agaaacgcaa 7 -> ! 

aaaggccatc cgtcaggatg gccttctgct taatttgatg cctggcagtt tatggcgggc 84 !■■ 

gtcctgcccg ccaccctccg ggcegt tgc t tcgcaacgtt caaatccgct cccggcggat 

ttgtcctac t caggagagcg ::caccgaca aacaacagat aaaacgaaag gcccagcc 1 1 9 6 0 

tcgacrgagc ctttcgtttt atttgatgcc tggcagttcc ctactctcgc atggggagac 10- ; ~) 

cccacactac catcggcgct acggcgttnc acttctgagt tcggcatggg gtcaggtggg l! 1 --'! 1 

accaccgcgc tactgccgcc aggcaaattc tgttttatca gaccgcttct gcgttctgat 1 140 

traatctgta tcaggctgaa aatctcctct catccgccaa aacaggatcc atcacgaagc 120 0 

gtcgtatcga aaaaattaat tttgcgcaac ggagaccacc gcttccttct tcttgccttg 12^0 

t:tcacaaac ggcatoatct cacgaagctt tcttcccac: acttcgattt gatgttcgtt 1:^.0 

ctcgcctgca tcgatagcgt cgaaacgagg acggt" tact: cggttttcga cgaiccactc 13 -:0 

tctcgcgaat gtaccgtttt ggatatcttt taa tact tct ttcatagatt cttttacttt 1440' 

agcgtccaca acgcgagggc ctgatacgaa atctccccac tgtgctgtgt cagagattga 1 0 ■!■ 0 

atatctcatt cctgcaagtc cttcttcgta cataaggtct acgattaatt t cage tea tg 1 :■ k- 0 

aagacactcg aagtargcaa gttcaggctg ataacctget tcagttaagg tttoaaatcc 16 2 0' 

ggctttgaca agegegctta acccgccgca aagaactget tgctcaccga acaaatctgt 1 o 0 

ttctgtttct tctttaaatg tcgtttctaa tacgcccgct cttgcgccgc cgattccttt 1740 

ageataageg agggctttgt ctcctgcttc tccagtcaca tcttgataga ttgcgaacaa 1-fOO 

tgcaggtacg ccagetce tt gc tea tat gt tct tcttacc aagtgtcccg ggectttagg lr-60 

ggccactaag aatacatcta catccgccgg aggaacaatt tgatggaaat gcacgttaaa 19 2 0- 

tccatgageg aatactaatg attttcctgc tgtcaattca tctttgattt cagcttegta 19^0 

tac 1 1 1 1 tgc tgc tgc teat ccggaagcag aacca tga tg attteggett ggge tgecge 

ttettttact gaaaa tac 1 1 t a tg t ccgtc ttct tgggc t tgagt gaaag attttecttg 21! 

t c t aacaccg acga tcaegt e tac t eege t 1 1 c 1 1 taagg 1 1 cagggca t g tgcg tggee 216 

1 1 gcgaaccg tacccga:aa ccgctactgt 1 1 1 1 ccagce a a tacg 1 1 c t c 1 1 1 ga ta t c 2 2 2 

accgttataa tatactttta cca 1 1 1 caa t c t c t ccc 1 1 g ttatgtttta tacaataga: 22 S 

attgttttat t gga t gaege c 1 1 1 1 gc t gg 1 1 cccc t cgc aaaagccg 1 1 gtacctgtt< 

t cgc gat 1 1 c 1 1 1 a a t geca taaggtttta a t aac t caa t aagege 1 1 ca attttgttag 2 4 



2 2 4 2 
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attcacctgt cacctgaaca acgatgctgt ctctgctgac arcaacgaca gaggctrtaa 246? 

acggttctat gattccatta atctctgttc ttgtt gaagg tgcggagaca accttgatta 2^2!- 

aggccagctc cctctiggaca atcgat:gat ttgtgatgtc tgtgactttc agcacaicaa 2 b - !■ 

tctgtttgtt gagccgtttc gt:aac:gtt caacatcatt t:caccttca acatgaacga 204 : 

cgaagg:gat tctggaaacg ccggctgttt ctgtgtgtcc aac:gcaa:g ctttcaatgt 27'..'? 

tgtaatgcct ttttgtgaat agaccggtga tccggcttaa caccccggag cggttcacca 270 ? 

cagtcaatgt gataattctt ttcaaggttt cacccccacc a:::catgca gccctttccc 2 : ;2 ':■ 

cggagccacc atcgggaata ctccttcttc gctggcaacc cgcacgtcaa tgacaaoagg 22 - 2 

ttctcttgat gttaatgcct cttccagct: ttcctttgct tccgcctcrg atgaaaccct 2940 

gatgccttta atgccgtatg cttcggacaa tccgacgaag tcaggctgag aagcgaactt 3 0 

agattctgaa taacgttctt catagaaaa: ttcctgccac tgtctgacca ttcogagaca 3 0 0 0 

agcgttattt aaaaccacta cctcgaccgg aagatttaat tcgcgaataa catcgagttc 3110 

ttgaagcgtc atttggaatc cgccgtctcc gacaaccgcg acaacagtag catottc::c 31--0 

ggccagctgt gcgccgatcg ccgccggaag accgaatccc atcgttccaa gtccgcccga 3 2 40 

cgtgacccat ttatctgctt t:tggaacgg ataaaactgc go:gaccaca tttgatgccg 3 3 0 j 

gcccacatcc gttgcgacaa tggcctctcc t:t"gtaaa" tgatgaatat attcaatcaa 3 3 0 0 

tttctgaggt ttaaaacctt cctcctcact atccacatac cagagcggat actcttcctt 342 

ccactctgcg agctgttttt tccattcgct tgaatcgcct tgtttgccgt cttgtttgat 3 4 0 

cage tec tgc aggacaattt tgctgtctcc gactacagga atctg:gt:t tcatgatttt 3543' 

tccgatttca gctggatcaa tatcgaegtg ggctatctnt gcgtrtetgg caaagtgttit 3600 

caggtttcct gtgacacggt catcaaaacg ggcgccgaca c:gac:aa:a gatcacactc 3 060 

atgaagggee atattggctg tataagtacc g:gca::ccc gccaccccta ggaaaagegg 372 ?' 

atggtcagcc gggaagcetc cgagccccaa aagggtgtgt gecacaggga tt tgc tgc eg 3 7 - -:0 

ttcagcataa ttttttaatt crtctgacgc ttttccgtgc agtacgcccg cacccgooag 3'^4j 

gatoaceggt tttttegcac tgctcacggc ttecacaagc ttgeggatet gcaaataatt 39 ?0 

cggctctgtt gtcggctggt accccgggag attcatctca tgategtage tgaattctcc 3 9 6 

ttcaattg-t. gctacatctt teggaatate aaccaataca ggtccgggtc ttceagttgt 4022 

tgcaatatgg aacgettett taatgatgcg eggcagatet tccggctggc gaacctggta 4 0 ?C 

gctgtgtttt gttactggca tcgtaatccc taaaar.gt:ct gettcctgaa a tgc a tract 

cccgattaca gaggttgeta cctgccctgt aaagacgact aacggcaatg aatcaatcat 4 2 



^ 1 A 

-L -± 
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ggcatcagca aggcctgtaa caaggtccgt cgctcccggc ctgacgtggc aatgacgaca 4260 

ccggtrtccg gagacccttg cgtatccctc cgctgcatga attgctcct: gctcgtgacg 43 2 0 

ggaaggatat gtaccaacct gaatgtatag cttatcgtaa atcggaagca cagccccgcc 43 --I0 

cggataaccg aagatcat:t ctactttctc "ttttttaat gattcaatca gcattaatcg 4440 

cccgcccatc gtctgtgtac attcggcaga cgctgaatcc acctgcacat tagtccccat 4 50 0 

tttatctcct cctctagagc gtcctgccgt tgttaagatt a:tataccac accttgtaga 4560 

caaag:caac aactttttgc aaaatttrtc aggaatttta gcagaggttg ttctggatgt 4610 

agaacaaaac atctttccgc tcttgtgctg ttaggatatc tttcttggaa gctaggtagg 4 6-10 

cctcgagtta tggcagttgg ttaaaaggaa acaaaaagac egttttcaca caaaacggtc 4740' 

tttttcgatt tctttttaca gtcacagcca cttttgcaaa aaccggacag cttcatgcct 4300 

cataactgct gtttcggtcg acgaagcggc cgccgtttaa acgaactcct gcagctggcg 4360 

aatggcgatt ttcgttcgtg aacacatgtt ataataacta caactaataa cgtaacgtga 4900 

ctggcaagag atatttttaa aacaatgaat aggtttacac tcactttagt tttatggaaa 4930 

cgaaagatca tatcatatat aatctagaat aaaattaact aaaataatta ttatctagat 5040' 

aaaaaattta gaagccaatg aaatctataa ataaactaaa tcaagtttat ttaattaaca 510 0 

accatggata taaaataggt actaatcaaa atagtgagga gga:acat:t gaatacatac 5160 

gaacaaatta ataaagtgaa aaaaatactt cggaaacatt naaaaaataa ccttattggt 521:0 

acttacatgt ttggatcagg agttgagagt ggactaaaac caaatagtga tcttgacttt 52-^0 

ttagtcgtcg tatctgaacc attgacagat caaagtaaag aaatacttat acaaaaaatt 53 40 

agacctattt caaaaaaaat aggagataaa agcaacttac gatatattga attaacaatt 5400 

attattcagc aagaaatggt accgcggaat catcctccca aacaagaant tatttatgga 5460 

gaatggttac aagagcttta tgaacaagga tacattcctc agaaggaatt aaattcagat 5520 

ttaaccataa tgctttacca agcaaaacga aaaaataaaa gaatatacgg aaattatgac 5530 

ttagaggaat tactacctga tattccattt tctgatgtga gaagagccat tatggattcg 5640 

tcagaggaat taatagataa ttatcaggat gatgaaacca actctatatt aactttatgc 5700 

cgtatgattt taactatgga cacgggtaaa atcataccaa aagatattgc gggaaatgca 5760 

gtggctgaat cttctccatt agaacatagg gagagaattt :gttagcagt tcgtagttat 5?i0 

ctrggagaga atattgaatg gactaatgaa aatgtaaaLL taactataaa ctiattitaaat: 5^0 

aacagattaa aaaaattata aaaaaattga aaaaatggtg gaaacact:t tttcaatttt 594!' 

tttgttttat tatttaatac ttgggaaata rrcartctaa ::ggtaatca gattttagaa 6 0 II 

aacaataaac cct.tgcar.ag ggggatcgat atccgtttag gctgggcggt gat a 
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cgt r cag gca gtacgcctct tttctcttcc agacctgagg gaggcggaaa tggtgtgagg 612 

ttcccgggga aaagccaaat aggcgatcgc gggagtgrtt tatttgaaga tcajg-::atc 61-; 

actgcggtca atagatttca caatgtgacg gctggacagc ctgaggaact ctcgaacccg 624 

aatggaaaca accagatatt tatgaatcag cgcggctcac atggcgttgt gctggcaaa: 6 :'• J 

geaggtteat cctctgtctc tatcaatacg gcaacaaaat tgcccgatgg caggtatgac 6 ? 6 

aataaagctg gagcgggttc atttcaagcg aacgatggta aactgacagg cacga:caa: 64^ 

gccaggtctg tagctgtgct ttatccrgat gatattgcaa aagcgcctca tgtttcoctt 64- 

gagaattaca aaacaggtgt aacacattct ttcaatgatc aactgacgat caccttgcgc 6 64 

gcagatgcga atacaacaaa agccgtrtat caaatcaata atggaccaga cgacaggcgt 66 j 

tcaaggacgg agatcaattc acaatcggaa aaggagatcc aattnggcaa aacatacacc 666 

aticatgttiaa aaggaacgaa cagtgatggn gtaacgagga ccgagaaata cagtttngtc 672 

aaaagagatc cagcgtcggc caaaaccacc ggctatcaaa atccgaatca ctggagccag 67-: 

gcaaatgcct atatctataa acatgatggg agccgagtaa ttgaattgac cggatccrgg 6-;4 

cctggaaaac caatgactaa aaatgcagac ggaatttaca cgctgacgct gcccgcggac 690 

acggacacaa ccaacgcaaa agtgatlttr aa^aatggca gcgcccaagt gcccggccag 696 

aatcagcc'g gctttgatta cgtgctaaa" ggctcatata atgactcggg cttaagcggc 7 02 

zczczzcccc attgagggca aggctagacg ggacttaccg aaagaaacca :caatga:gg 7 0'-' 

tttctttttt gttcataaat cagacaaaac ttctctctcg caaaagtttg cgaagtgttg 7 14 

cacaacataa atgtgaaata cttcacaaac aaaaagacat caaagagaaa cataccctgc 72 0 

aaggatgctg atattgtctg catttgcgcc ggagcaaacc aaaaacctgg tgagacacgc 726 

cctgaattag cagaaaagaa cttgaagatt ctcaaaggca tcgtcagtga agccarggcg 7 <2 

agcggatttg acggcatttt cttagtcggg cggcaccccg ctaacggatt cac-:ac:cca 7 6-: 

agaa^tggag ccaatcaatt cttgcggaga actgtgaacg cgcaaaccaa cccttggcag 744 

aacacatcca tcgcgtccgc catctccagc agccgcacgc ggcgcatctc gggcagcgtt 7 60 

gggtcctggc cacgggtgcg catgatcgtg c:cc:gtcgt tgaggacccg gctaggcrgg 7 6 6 

cggggttgcc ttactggtta gcagaatgaa zcaccqazac gcgagcgaac gtgaagcgac 7 6 2 

rgctgctgca aaacgtctgc gacctgagca acaaca:gaa tiggtcttcgg ::tccg:g:: 76-- 

tcgtaaagtc tggaaacgcg gaagtcagcg ccctqcacca "tatgttccg gatctgcatc 774 

gcaggatgct gctggctacc ctgtggaaca cc:aca:ctg tattaacgaa gcgcrggcat 7-.' 

"gaccctgag :gatt::tc: ctggtcccgc cgcatccata ccgccagttg zzzacccZca. 7 S 6 
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caacgttcca gtaaccgggc atgttcatca tcagtaaccc gtatcgtgag catrctctcc ^79^ 
cgtt:carcg gtatcatnao cccca:gaac agaaatcccc ct:acacgga ggcatcagtg 7 : J - 



accaaacagg aaaaaaccgc cc::aaca:g gcc-:gc::t:a tcagaagcca gacattaacg 8 



t 



cttctggaga aacccaacga gctggacgcg gatgaacagg cagacatctg tgaatcgc:: 8 1 

cacgaccacg ctgatgagct ::accgcagc tgccccgcgc gtttcggcga tgacggtgaa c < 1 

aacccctgac acatgcagct cc eg gaga eg gtcacagc:: gtctgtaagc ggatgccggg 

agcagacaag cccgncaggg cgcgtcagcg ggtgttggeg ggtgregggg cgcagcca:g : 



acocagccac gcagcgatag eggagrgtat ac:ggc::aa ctatgcggca "cagagcaga 



t^gtiactigag agtgcacca: atgcggtgtg aaatiaccgca cagargcgta aggagaaaat 



acogcaccag gcgc:cttcc gcttcctcgc tcactgac:c gctgcgctcg gtcgtccggc 



■7' '± 



tgeggegage gg rat eager cactcaaagg eggtaataeg g::atccaca gaatcagggg 7 72 

ataaegcagg aaagaacatg cgagcaaaag gecagcaaaa ggecaggaac cgtaaaaagg 7 7 -■ 

ccgcgtcgct ggegctttte cataggctcc gceeeeerga cgageateae aaaaatcgac 

geteaagtea gaggeggega aacccgacag gactataaag ataccaggcg :ttccccccg 8 7 0 

gaagccccct cgcgcgctcc cctgt:ccga ccctgccgc: taccggacac etgtcegeet -'7 7 



t 



tte gggaagegtg gcgctctc:c atagctcacg ctgtaggcat cccagtccgg 



tgraggtegt tcgctccaag etgggetgtg tgcacgaacc ccccgt:cag cccgaccgc: 7 7---.] 

gcgccttatc cggtaac'at cgccttgagc ccaacccgg: aagacacgac ctatcgccac 794 ; 

tggcagcagc cactggtaac aggattagca gagegaggca tgtaggeggt geracagagr 970 ; 

tctrgaagtg gtggcctaac tacggctaca ctagaaggac agrarrrggt atctgcgctc 9 7 6!: 

tgetgaagee agctacc::-: ggaaaaagag ttggtagctc ttgatcegge aaacaaacca 9 12; 

ccgctggcag cggtggtttc ttrgtttgca agcagcagat tacgegcaga aaaaaaggar 9 1 ~< 7 

ctcaagaaga ccctttgatc tctcccacgg ggcctgacgc teageggaac gaaaac:cac 92 4 " : 

gttaagggac tttggccatg agactatcaa aaaggatcct cacc:agat:c cttttaaat: 



aaaaatgaag tt:taaatca atctaaagta cata:gagta aacttggtct gacagtta 



aatgettaat cagtgaggca cccatctcag cgatctgtct at::cg:tca tccatag::g 9 4 



gacteee cg:cg:g:ag ataaetaega taegggaggg cttaeeatet ggccccag:g 9 



ctgcaatga: aeegegagae ccacgctcac cggctccaga LL:at.cagca a:aaa:cagc — - t 



cagceggaag ggccgagcgc agaageggtc etgeaaettt atccgcctcc a:ccagtc:a 
t:.aa:tgr:g cegggaage: agagtaagta gttcgccag: raat.agt.rrg egeaaegtrg 9 7 



ttgceattge rgraggcatc grggrgteae gc 



■ a t a t cr a i 



a t tcaac 
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ccggttccca acgatcaagg cgagttacat gatcccccat gttgtgcaaa aaagcggtta 9730 

gctccttcgg tcctccgatc gttgtcagaa gtaagttggc cgcagtgtta tcac:catgg 9 340 

":a:ggcagc actgcataa: tctcttactg tcatgccatc cg:aagatgc t:tcc:gtga 9 9 CO 

ccgg:gagta cccaaccaag tca:tctgag aatagtg:a: gcggcgaccg agttgctctt 99^9 

goccggcgtc aacacggga: aataccgcgc cacatagcag aactttaaaa gtgctcatca 10C 2 0 

ctggaaaacg ttcttcgggg cgaaaaccct caagga'ct: accgctgt:g agatccagtt 1 0 1 0 r 

cgatgtaacc cactcgtgca cccaaccgat cttcagcatc c::cac:t:c accagcgt:t 10140 

ctgggtgagc aaaaacagga aggcaaaatg ccqca.a.3.^a.a. gggaaraagg gcgacacgga IjOjO 

aatgttgaat actcatactc ttcctttttc aatat:a:tg aagcatctat cagggtcatt 1.0060 

gccccatgag cggatacata tt'gaatgta tttagaaaaa caaacaaata ggggttccgc 1032 0 

gcacatttcc ccgaaaagtg ccacctgacg tctaagaaac cat tar care atgaca:taa 103:>j 

cctataaaaa taggegtate acgaggccct ttegtcttea agaatt 



^ 



< 2 1 0 > 3 3 

< 2 1 L > 4 L 9 1 

< 2 1 2 > DNA 

-:2 13> Artificial Sequence 

<22 3 > 

<22 3 > Description of Artificial Sequence: Recombinant 
pAN2 6 3 pi asm id 

<4 00> 3 3 

ccaatacgca aaccgcctct ccccgcgcgt tggecgatte attaatgeag cccggatccg 6 0 

aaagaagagg atatcceggt tttacagaag gcattggatg atccaaaggt gtccatcaga 12 0 

agacaggctg ttgtgtactt aggaatgatt gaaacacctg atgttcttcc tctattgtat 13 J 

aaagcacttg aggacaaagc tgtatcagtc agaagaaegg ccggagactg cctgtctgat 24 J 

ateggegate ctcaagccat tectgetatg atcaagtcac taagegaetc cagcaagctt 3 0 j 

gttcgctggc gcgccgccat gttcctgtac gaagtcggcg atgaaagtgc aattgaagct 3 60 

ttgcgcgctg ccgaagatga ccccgaattt gaggtcagee ttcaagtcaa aatggcgctt 420 

gaacgtattg agcatggaga agaagcaaaa ggttctgttt ggaaacaaat gaeggaaage 43 j 

agaaaaaaag gcgaataaag ataaaaaagg tgcagatcat gcaccttttt tatgtgaatt 5 4 J 

ggtcgaccga aacagcagtt ataaggcatg aagctgtccg gttttt^caa aagtggctg 

gactgtaaaa agaaatcgaa aaagaccgtt ttgtgtgaaa aeggtctttt tgtttccttt 6 

:aa:caactg ccataactcg aggcctacct agcttccaag aaagatatcc taacagcaca 7 

aaa^caaaaa ciatatt. ttat tctacar. cca aa.ac a.a.cc Zc tactaaaatt cctcraaaaat 7 
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tttgcaaaaa gttgttgact ttatctacaa ggtgtggtat aataatctta acaacagcag 340 
gacgctctag aggaggagac accatggcag aactacgcag taatargatc acacaaggaa 900 
tcgacagagc tccgcaccgc agtttgcttc gtgcagcagg ggtaaaagaa gaggatttcg 960 
gcaagccgtt tattgcggtg tgtaattcat acattgatat cgttcccggt catgttcact 102 3 
tgcaggagtt tgggaaaatc gtaaaagaag caatcagaga agcagggggc gttccgtttg 10*3 
aatttaatac cattggggta gatgatggca tcgcaatggg gcatatcggt atgagatatt 1140 
cgctgccaag ccgtgaaatt atcgcagact ctgtggaaac ggttgtatcc gcacactggt 120 3 
ttgacggaat ggtctgtatt ccgaactgcg acaaaatcac accgggaatg cttatggcgg 126 0 
caatgcgcat caacattccg acgatttttg tcagcggcgg accgatggcg gcaggaagaa 132 0 
caagttacgg gcgaaaaatc tccctttcct cagtattcga aggggtaggc gcctaccaag 13-0 
cagggaaaat caacgaaaac gagcttcaag aactagagca gttcggatgc ccaacgtgcg 1440 
ggtcttgctc aggcatgttt acggcgaact caatgaactg tctgtcagaa gcacttggtc 1500 
ttgctttgcc gggtaatgga accattctgg caacatctcc ggaacgcaaa gagtttgtga 1360 
gaaaatcggc tgcgcaatta atggaaacga ttcgcaaaga tatcaaaccg cgtgatattg 1620 
ttacagtaaa agcgattgat aacgcgtttg cactcgatat ggcgctcgga ggttctacaa 16^0 
ataccgttct tcataccctt gcccttgcaa acgaagccgg cgttgaatac tctttagaac 174 3 
gcattaacga agtcgctgag cgcgtgccgc acttggctaa gctggcgcct gcatcggatg 1-00 
tgtttattga agatcttcac gaagcgggcg gcgtttcagc ggctctgaat gagctttcga 1S60 
agaaagaagg agcgcttcat ttagatgcgc tgactgttac aggaaaaact cttggagaaa 1920 
ccattgccgg acatgaagta aaggattatg acgtcattca cccgctggat caaccattca 19 30 
ctgaaaaggg aggccttgct gttttattcg gtaatctagc tccggacggc gctatcatta 2340 
aaacaggcgg cgtacagaat gggattacaa gacacgaagg gccggctgtc gtattcgatt 2100 
ctcaggacga ggcgcttgac ggcattatca accgaaaagt aaaagaaggc gacgttgtca 1160 
tcatcagata cgaagggcca aaaggcggac ctggcatgcc ggaaatgctg gcgccaacat 2220 
cccaaatcgt tggaatggga ctcgggccaa aagtggcatt gattacggac ggacgtttt.t .280 
ccggagcctc ccgtggcctc tcaatcggcc acgtatcacc tgaggccgct gagggcgggc 2340 
cgcttgcctt tgttgaaaac ggagaccata ttatcgttga tattgaaaaa cgcatcttgg 2400 
atgtacaagt gccagaagaa gagtgggaaa aacgaaaagc gaactggaaa ggttttgaac 24-50 
cgaaagtgaa aaccggctac ctggcacgtc atcctaaact tgtgacaagt: gccaacaccg 2520 
gcggtattat gaaaatctag acccctggcg taatagcgaa gaggcccgca ccgaccgccc 2580 
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ttcccaacag ttgcgcagcc tgaatggcga atgagcttgc gccgtcccgt caagtcagcg 264 ? 

taatgctctg ccagtgttac aaccaattaa ccaattctga ttagaaaaac tcatcgagca 210'. 

tcaaatgaaa ctgcaattta ttcatatcag gattatcaac accatatttt rgaaaaagcc 272! 

gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca agatcctggt 23.1" 

atcggtctgc gattccgact cgtccaacat caatacaacc tat taatt tc ccctcgtcaa 23-1 

aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggc gagaatggca 294 1 

aaaggttatg catttctttc cagacttgtt caacaggcca gccattacgc ccgtcatcaa 3 *") 2 

aatcactcgc atcaaccaaa ccgttatcca ttcgtgattg cgcctgagcg agacgaaata 302! 

cgcgatcgct gttaaaagga caattacaaa caggaancga atgcaaccgg cgcaggaaca 312" 

ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat acctggaatg 31"---! 

ctgttttccc agggatcgca gtggtgagta accatgcatc ancaggagta cggataaaat 3241 

gcttgatggt cggaagaggc ataaatrccg tcagccagt: tagtctgacc atctcatctg 3 3 0! 

taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc gcatcgggct 3 321 

tcccatacaa tcaatagatt gtcgcacctg attgcccgac arcatcgcga gcccatttat 342l 

acccatataa atcagcatcc atgttggaat ttaatcgcgg ccccgacgag caagacgttt 34--! 

cccgttgaat atggctcata acaccccttg tattactignt tatgtaagca gacagtttta 3^4-! 

ttgttcatga tgatatattt ttatcttgtg caatgtaaca tcagagatrn tgagacactc 3 6 " ■■! 

gacaagatga tcttcttgag atcgttttgg tctgcgcgta atctcttgcc ctgaaaacga 3 62l 

aaaaaccgcc ttgcagggcg gtttttcgaa ggttctctga gctaccaact ctttgaaccg 372-! 

aggtaactgg cttggaggag cgcagtcacc aaaacttgtc ctttcagttt agccttaacc 3 7 3 2 

ggcgcatgac ttcaagacta ac:cc:ctaa a:caa::acc agtggctgcc gccagtggtg 3 842 

ctcttgcatg tctttccggg ttggactcaa gacgatagct accggataag gcgcagcggt 39 >■ 

cggactgaac ggggggttcg tgcatacagt ccagcttgga gcgaactgcc tacccggaac 3 9 *5 ■- 

tgagtgtcag gcgtggaatg agacaaacgc ggccataaca gcggaatgac accggtaaac 402i 

cgaaaggcag gaacaggaga gcgcacgagg gagccgccag gggaaacgcc tggtatcttt 403' 

atagtcctgt cgggtttcgc caccactgat t "gage g tea gatttcgtga tgcttgtcag 4140 

gggggcggag cctatggaaa aacggctttg ccgcggccct ctcacttccc t 4191 



<2 10> 8 4 

<2 11> 7 02 

<2 12> DMA 

<213> Bacillus subtilis 
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<221> CDS 

<222> ( 1 ) . . ( 699 ) 

<400> 34 

ttg tta ctg gtt ate gat gtg ggg aa: acc aat act gta ctt ggt gta 4 8 

Met Leu Leu Val lie Asp Va 1 Gly Asn Thr Asn Thr Val Leu Gly Val 

1 5 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp GLy Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 

20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 144 
His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag at .a gat gge att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie lie Ser Ser 
5 0 5 5 6 0 



j- \j 



gta gtg ecg eea ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 

6 5 7 0 7 5 3 0 

ttt cat ate gag cct eaa att gtt ggt eca ggt atg aaa aec ggt tta 23 3 

Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 

3 5 9 0 9 5 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gea gac aga ate gta .3 3 6 

Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala A.i;p Arg lie Val 

10 0 10 5 110 

aat get gtc get gcg ata cac ttg tac ggc aat cea tta att gtt gte 3 34 

Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu lie Val Val 

115 12 0 125 



32 



gat ttc gga acc gec aca acg tac tgc tat att gat gaa aac aaa caa 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 

130 135 140 

tac atg ggc ggg gcg att gee cct ggg att aca att teg aca gag gcg 4 8-0 

Tyr Met Gly Gly Ala lie Ala Pro Gly lie Thr lie Ser Thr Glu Ala 

145 150 155 16 0 

ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc egg cce 52 8 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 

165 17 0 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 57 6 

Asp Asn lie lie Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly lie 

130 135 19 0 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 62 4 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly lie Val Lys Arg Met Lys 

19 5 2 J 0 2 J 3 

tgg cag gca aaa cag gac cca agg tea ttg cga cag gag gec tgg cgc 672 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 



cac tea tta eaa aca aat eaa att ata taa 
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Arg Ser Leu Arg Thr Asn Gin lie Val 

2 2 5 2 3 0 



<210-> 35 
<211 > 233 
<212 ■> ?RT 

<213> Bacillus subtilis 
< 4 0 0 > 3 5 

Met Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 

2 0 2 5 3 0 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie He He Ser Ser 

5 0 5 5 6 0 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 

6 5 7 0 7 5 8 0 

Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 

3 5 9 0 9 5 

Asn Tie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 

10 0 10 5 110 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 IIS 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
13 0 13 5 14 0 

Tyr Met Gly Gly Ala He Ala Pro Gly lie Thr He Ser Thr Glu Ala 
14 5 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 

165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 

130 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 20 0 20 5 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 2 15 2 2 0 

Arg Ser Leu Arg Thr Asn Gin lie Val 

215 23 0 
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<221> CDS 

<222> (1) ■ • (1620) 



- ^rrt nra ttc caq gtg caa aaa ttg atg egg tat ttg acg ctu 

Me't Tyr Leu Ala Phe Gin 5a? Gin Lys Leu Met Arg Tyr Leu Thr Leu 



1 5 1° 



cac aag ata aag gac ctg aaa tta teg ttg ccc ggc acg aac aaa acg 
Tyr Lys lie Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr An Lys Thr 

20 25 

cag caa ttc atg gec caa gca gtc ggc cgt tta act gga aaa ccg gga 
Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
35 40 45 

gtc gtg tta gtc aca tea gga ccg ggt gec tct aac ttg gca aca ggc 
Val 111 Leu Sal Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 



50 bb 



ctg ctg aca gcg aac act gaa gga gac cct gtc gtt gcg ctt get gga 
III III Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 



65 



70 75 



aac gtg ate cgt gca tat cgt tta aaa egg aca cat caa tct ttg gat 
Asn III HI Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser Leu Asp 

85 90 

aat gcg gcg eta ttc eag ceg att aca aaa tac agt gta gaa gtt caa 
Asn Ala Ala Leu Phe Gin Pro He l'nr uy^ xyr Se, .a. C.u 

100 105 

% i" £ s: vz s" s - - - - i - »■ - 

115 120 125 

£ s: SS §2 e - K - - - £ "° - " 

135 140 



130 



„m aat aaa ate aca aat acg aaa aac gtg cgt get gtt gca gcg cca 
Va! Tsn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 



145 



150 ^5 



4 3 



144 



2 0 0 - u D 



ctt cca ttt gtt gaa aca tat caa get gec ggt acc ctt tct aga gat 
* e » Phe Val Glu Thr Tyr Gin Ala Ala G.y A hr .eu ,er -so 



192 



2 i u " J 



ta gag gat caa tat ttt ggc cgt ate ggt ttg ttc cgc aac cag cct 



240 



288 



336 



384 



432 



480 



aaa etc ggt cct gca gca gat gat gca ate agt gcg gec ata gca aaa 528 
lys Leu G?y Pro Ala Ala Asp Asp Ala He Ser Ala Ala lie Ala Lys 

165 170 

ate caa aca gca aaa ett cct gtc gtt ttg gtc ggc atg aaa ggc gga 576 
ne Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 

180 185 I 90 

aga ceg gaa gca att aaa geg gtt cgc aag ctt ttg aaa aag gtt cag 624 
Ta Pro Glu Ala lie Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 



672 



„ ^ ^ 
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Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly Leu Phe Arg Asn Gin Pro 

2 2 5 230 23 5 24 J 

ggc gat tta ctg eta gag cag go a gat gtt gtt ctg a eg ate gge tat 76 8 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Va 1 Val Leu Thr lie Gly Tyr 

245 250 255 

gac rcg att gaa tat gat ccg aaa ttc tgg aat ate aat gga gae egg 816 

Asp Pro lie Glu Tyr Asp Pro Lys Phe Trp Asn lie Asn Gly Asp Arg 

2 6 0 2 6 5 2 7 0 

aca att ate eat tta gae gag att ate get gac att gat eat get tae 864 

Thr lie lie His Leu Asp Glu lie lie Ala Asp lie Asp His Ala Tyr 

275 2 30 235 

eag eet gat ett gaa ttg ate ggt gae att ccg tec aeg ate aat cat 912 

Gin Pro Asp Leu Glu Leu lie Gly Asp lie Pro Ser Thr lie Asn His 

2 9 J 2 9 5 3 0 0 

ate gaa eae gat get gtg aaa gtg gaa ttt gca gag egt gag eag aaa 960 

He Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 

3 3 5 310 315 3 2 0 

ate ett tct gat tta aaa eaa tat atg cat gaa ggt gag eag gtg cet 1008 

lie Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val Pro 

3 2 5 3 3 0 3 3 5 

gea gat tgg aaa tea gae aga gcg eae cet ett gaa ate gtt aaa gag 1056 

Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu lie Val Lys Glu 

3 4 0 3 4 5 3 5 0 

ttg egt aat gea gte gat gat cat gtt aca gta act tgc gat ate ggt 1104 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Gys Asp lie Gly 

3 5 5 3 6 0 3 6 5 

teg eae tec att tgg atg tea egt tat tte cgc age tac gag ccg tta 1152 

Ser His Ser lie Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 

3 7 0 3 7 5 3 8 0 

aea tta atg ate agt aae ggt atg eaa aca etc ggc gtt gcg ett cec 12 00 

Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 

3 3 5 3 9 0 3 9 5 4 0 0 

tgg gca ate ggc get tea ttg gtg aaa ccg gga gaa aaa gtg gtt tct 12 48 

Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser 

4 0 5 410 4 15 

gte tct ggt gac ggc ggt ttc tta ttc tea gca atg gaa tta gag aca 12 9 6 

Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr 

420 425 43.) 

gca gtt cga eta aaa gca cca att gta cac att gta tgg aac gac age 13 44 

Ala Val Arg Leu Lys Ala Pro He Val His He Val Trp Asn Asp Ser 

435 440 44 5 

aca tat gac atg gtg cat ttc cag caa ttg aaa aaa tat aae egt ara 13 92 

Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 



-t O U 



4 A 



tct gcg gte gat ttc gga aat ate gat ate gtg aaa tat gcg gaa age 
5 e r Ala Va 1 Asp Phe Gly Asn lie Asp lie Va 1 Lys Ty r Ala Glu Ser 
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465 470 475 480 

ttc gga gca act gcg ttg cgc gta gaa tea cca gac cag ctg gca gat 1488 

Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Pro Asp Gin Leu Ala Asp 

485 490 495 

gtt ctg cgt caa ggc atg aac get gaa ggt cct gtc ate ate gat gtc 1536 

V^l Leu Arg Gin Gly Met Asn Ala Glu Gly Pro Val lie lie Asp Val 

5D0 505 510 



ccg gtt gac tac agt gat aac att aat tta gca age gac aag ctt ccg 1584 

Pro Val Asp Tyr Ser Asp Asn lie Asn Leu Ala Ser Asp Lys Leu Pro 

515 520 525 

a a a gaa ttc ggg gaa etc atg aaa acg aaa get etc tag 162 3 

Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 

530 535 54 0 



<110> 87 
<111> 540 
<.-12> PRT 

<113> Bacillus subtilis 
<400> 87 

Mot Tyr Leu Ala Phe Gin Val Gin Lys Leu Met Arg Tyr Leu Thr Leu 
15 10 15 

Tyr Lys lie Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 

2 0 2 5 3 0 

Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
3 5 4 0 4 5 

Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 
5 0 5 5 6 0 

Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 

6 5 7 0 7 5 8 0 

Asn Val lie Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser Leu Asp 

85 90 95 

Asn Ala Ala Leu Phe Gin Pro lie Thr Lys Tyr Ser Val Glu Val Gin 

100 105 110 

Asp Val Lys Asn lie ? ro Glu Ala Val Thr Asn Ala Phe Arg lie Ala 
115 120 125 

Ser Ala Gly Gin Ala Gly Ala Ala Phe Val Ser Phe Pro Gin Asp Val 

130 135 14 0 

Val Asn Glu Val Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 

145 15 0 155 16 



j 



-ys Leu Gly Pro Ala Ala Asp Asp Ala lie Ser Ala Ala lie Ala Lys 

165 17 0 175 

ie Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 
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Arg Pro Glu Ala He Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 20d 



Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 



215 220 



Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly Leu Phe Arg Asn Gin Pro 

230 235 



225 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Val Val Leu Thr He Gly Tyr 

245 250 255 



Asp Pro He Glu Tyr Asp Pro Lys Phe Trp Asn He Asn Gly Asp Arg 

260 265 270 

Thr He He His Leu Asp Glu He He Ala Asp He Asp His Ala Tyr 
275 280 285 

Gin Pro Asp Leu Glu Leu He Gly Asp He Pro Ser Thr He Asn His 
290 295 300 

He Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 
305 310 315 320 



He Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val Pro 

325 



330 335 



Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu He Val Lys Glu 

340 345 350 



Leu Arg Asn Ala Val Asp Asp His 
355 



Val Thr Val Thr Cys Asp He Gly 



360 365 



Ser His Ser He Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 
370 



375 380 



Thr Leu Met He Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 
385 390 395 400 

Trp Ala He Gly Ala Ser Leu Val Lys Pro Gly Glu Lys Val Val Ser 



405 



410 



Val Ser Gly Asp Gly Gly Phe Leu Phe Ser Ala Met Glu Leu Glu Thr 

420 



425 430 



Ala Val Arg Leu Lys Ala Pro lie Val His He Val Trp Asn Asp Ser 
435 



440 445 



Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 

450 455 460 

Ser Ala Val Asp Phe Gly Asn He Asp He Val Lys Tyr Ala Glu Ser 

465 470 475 480 



Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Fro hs P 

4 8 5 



49) 495 



Val Leu Arg Gin Gly Met Asn A! 



Giu Gly Pro Val lie lie Asp Val 



*~ ''.nil i~ i a, i^t v-j -u ^ v^^.^ - - 

ni 5 0 5 51 



Pro Val Asp Tyr Ser Asp Asn He Asn Leu rt ic 



a q er Asd Lys Leu Pro 



I2S - 



^ 1 



^ o 



5 2 5 



c 



r-. - ci _ « 



_ w » ^ 

2 1 1 13 

212 > DNA 

213 > Artificial Sequence 



223- Description of Artificial Sequence: ribosome 



binding site 



<22 3 * A. 11 occurrences of n indicate any nucleotide 



^ u .j ■ n 



igaauggagg tgannnnnnn atg 



2 1 .1 



PRT 

Artificial Sequence 



9 O 



Description of Artificial Sequence: PanC 
C terminus 



4 ^ r\ . ^ a 



le Ar :f Glu Met Giu Arg lie 



210 



i r ic lal beanence 



> Description or Ar 
C t ermi nus 



iDcia. Sequence: PanC 



rg Glu Arg Arg 



i c :a. 
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<— M, -l. 



:i;a 



2 1 3 > Artificial S e qu e n c e 



■ — i ' — ^. 
^ Z j 



3 - I-escription of Artificial Sequence: Recombinant 
t'AN3 3 6 olasmid 



410 - r ' 



■ _ O 



1 T 



6 6 0 



., ?cg ,. c <:::T.a cagggcgcgt ccattcgcca ttcaggctgc gcaactgttcj ggaagggcga ^ 
._ cgg . gcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcocag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgt^atacg actcactata gggcgaattg ggcccgacgt cgcatgcacc aggcttctca 240 
ggcgctgact tagaaaacct cttgaatgaa gctgcgcttg tagcggctcg tcaaaacaag 300 
aaaaaaatcg atgcgcgtga tattgacgaa gcgacggacc gtgtaattgc cggacecg 
aaga.ag,jg-:c gcgttatctc caagaaagaa cgcaatatcg tggcttat-ca cgaaggrgga 
racaccgtca tcggtctcgt tttagatgag gcagatatgg ttcataaagt aacgattgtt 480 
cctcggggcc aggctggcgg ttatgctgtt atgctgccaa gagaagaccg ttatttccaa 540 
acaaagccrjg agctgcttga taaaattgtc ggcctcttgg gcggacgt?t tgctgaagag 600 
attatcttcg gtgaagtcag cacaggggcg cacaatgact tccagcgtgc gacgaatatt 
gcaagacgaa rggttacaga attcggtatg t.cagaaaaac tgggaccgtt gcaatttgga 720 
cagtctcagg gcggtcaggt attcttaggc cgtgatttca acaacgaaca =jaactacag 
gatcaaatcg cttacgaaat tgatcaggaa attcagcgca tcatcaaaga atgttatgag 84 : 
cgtgcgaaac aaatcctgac tgaaaatcgt gacaagcttg aattgattgc ccaaacgctt 900 
ctgaaagttg aaacgcttga cgctgaacaa atcaaacacc ttatcgatca tggaacatta 
:ctgagcgta anttctcaga tgatgaaaag aacgatgatg tgaaagtaaa cattccgaca 1 
aaaacagaag aaaagaaaga cgatacgaaa gagt:aattcg ctttctttct aaaaaaactg 2 
ccggctgacg ctggcagttt ttttacgcaa atgattggct cagctgcggc tctcacaatc 
atccaattct rrgtaccgatt cgttcacaaa tgagccgctg accgcgtatg gtatrgtaga 
a:a :::^.aa aaagt.aaagt agagaaacca ttcaaaagtg gtgatagagg 
► . , ~ • : i * : ' aaaaaaf -ice r"aca;:ic:ca a t~ :a : a * 

. ..... ; ^ f r.-,.-;--a :a r c c :;a r. a :gg c a aga^ata 

_ - -- - — - — «- — ~i *- -~. -~- ■* ^ *~ *~ a 

-,, _ , — - T - - - - — -t ^ -r a ^"4 — — -•» .-"4 *- j - ♦ . J - J 1 . V_ - .1 .. _' ^.t ■■■■ -- ■-' - - • -» - - ~ ■ 



- - - ~> 

^ ^ ^ ^ 



c • : • • i \a i a T * cataaggaaa - * 



B(;m4icp 



I M) - 



caatactgat aaga^ata: 
' ■- a a g g • • ; ♦ — 



^aga :a: 



..at: -:t t 



"T ■■' T .T . 



; : " * 1 * < at. a^.a: c t a 



^ctaatact t 



- gaaacccaqa acaataacct -at 



a -._ _ _ a z 



^taattgtgg rc:ca 



aaatc gg ctccgtcg ar.ac:ar.a:: 



actttgaaaa aactcr r> — .^^^ 



a g 1 1 c a 



g t t a t a a 1 1 a 



gc*:tcttggg gtatc 
gaaataataa atggctaaaa tgagaatatc ace 
ccgcngcgta aaagaracgq aaggaatgtc - 



atacgccaac tttcaaaaca 
aa agttttagaa tgcaaggaac agtgaa::^, 

da atactgtaga aaagagaaag 
gaattg aaaaaactga tcgaaaaata 
gctaag gtatataagc r.ggr.gqgaga 



a a a t ga a a a c r r a - a - - - ^ ^ ^ ^ 

^-^ ad aaatgacgga cagceggtat 



ggaacgggaa aaggacatga tae- 



a a a g gg a c c a c c t a t g a t g - 

gctatggct ggaagaaaaa ar .^ r .^ r . - 

" ^ - - ^ a - ^ { - c a ci ci g g t c c t 

acactttgaa eggcatgatg gctggagcaa 
t:tgctcggaa aaa 



t-ttgetcatg agtgaggccg ataaca- ; 
gtatgaag atgaacaaag ccc:;gaaaag 
gtgeatcagg ctctttcac- cca: cga 



aaactgggaa gaagacactc catrtaaaga tc 
ggaaaagece gaa^faggaa 
"gtgaaagat ggcaac 



attar: ■ • g age z g z a t g c g g a 
a * "aa atagct taga 
ggccgatg tgaattg-ga 
-gegegag ctgtatgatt ttttaaagac 
tgtcttttc ccac- 



cat ateggattgt ccct 

'ageegctta gecgaattgg a -,r-,^ ^ 

-gy d -cic,.a... gaataacaat rr 



— _. 



ggegae ctgggagaca gcidcdicrt 

l9taa ^Wrcttat t g .,tctr. ggg acaagegaca gggcgg acaa 
g-ggtatgac a:t«cc::c; gegtregg 

"gtcgagcra cttrnt 



^. 1 — 

- -± C 



'gtc gatcagggag gatatcgac 
gact tactggggat caagectaat tgggagc 
tattttactg gatgaatetg 
a aa::t- 



gggg aagaacaa: ( 



a 



a gacatcta 



gaa a a t a a a a t a 1 1 a 
ttagtacct agatt.taaat gt.ctaaaaag ctttaactac 
at cttttctga^ gtacatccae 



aa ctgtccat 



ac t c t a a tat 



ttatatct, t t - c ~- 



-ctaaaagt t 



'get: a gat. a aaazrteccaa acar-n*--^^-f^ - 

- - ---- v--' - 1 r ^ y v (. ^ a l cj a ct a a a ^ t " ~r ^ 



cgeca z z eg cca: t c, 



a a g g t c a 1 1 a 



c a a 



a a 



c a g g c t g c g c a a c t g 

a g g c a a . x a c a g g a c 

v * ' a . g ; ; T ; • ; * a 

'aaaq.iaa r." 



^ -gggaaaaa 



a a 



: * a • 



j j a a a a re* ; era a" * ; . • 

T ^ *" 1 _ T-T3 ■, ^ 

~ - " " ^ - rl a g - - a g t a a - a : . . - - a - - * , . T . 

ac : : • a j j : 



k. k 



ci . .a 



i a 
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t a a c ccgcaag :r.ca::.::ga t t t a :i a r. rj a a c a a g g t a a g c ":ga r. g t c a g 



■j ; £ g 
~? ~j 



Cl M '._ 



ci a 



:raac a : : aa 



_ 7.1 :i d 



c a ■ 



caca g g a c a a g t-agaaa t c git:.?ag g a g a a t t a g g a 



CI _ — H CI 



c::ac:a:c 



cater. 



g a g c a g g 



teat rag t gggcgtaggt gtger egtaa at cctgaca a 3r4 



raccattctt gcggragggg get t tat tat tcagrtgatg ccgggaacag atgatgaaac 36 C 
aatcacaaaa a t r g aacag e gtctatrtca agtagagreg atttctaage teatccaaaa 3 66 
agggrtgaca ceagaagaaa ttttagaaga agtcetaggc gaga a ace tg agattttgga 3 7 2 



aacgatgect gtcagattce attgeeettg ttraaaagaa cggttegaaa eagecatttt 37? 



aggactaggc aaaaaagaaa ttcaagatat ga:agaagaa garggacaag eegaagcagt 3^4 



atgerac 



gtaatgaaa agtaettatt tacaaaagaa gagctggaag ggcttcgtga 3 9 



r\ ■">, ^ 



ccaaactac: egctaaget-: tttagegggt ttttaatttg agaaaagggg etgaaagcag 3 96 



'j 



gtttjaaate a agaacaat: tggacgcgtt ggatgeatag cttgagta 



a t a g t g t c t 



acedia a tag rttggrgtaa tcatggteat ag:tgtttcc tgtgtgaaat tgttatccgc 4 



teaeaattcr a-::acaacata rgagccggaa geitaaagtg taaageergg ggtgectaat 4143 
gagt jagcta a^tcacatta attgcgttge gctcartgec cgcttteeag trgggaaacc 4230 
tgtcjtgrca getgeattaa tgaategger aac g rgc ggg gagaggrggt ttgcgtattg ±111 



0* gcg- 



get r e rt eg e t ractgac t rge tgcg r tc ggr rgt t r g g e t gc ggegag 4 3 2 3 



eggtt it e age teacteaaag geggtaatae ggttarccac agaatcaggg gataaegcag 4 3 



gaaagaacat gtgageaaaa ggiragcaaa aggecaggaa ee 



gtaaaaag geegegttge 4 44 



ggcg- 



ga_aggetc eg zee eeetg acgagcatca caaaaarcga cgeteaagtc 4r3 



agaggr^gcg aaaeeegac:a ggactataaa gataccaggc gtttceeeet ggaagetcce 



z r o 



rcgtgcgctc teergtteeg accctgccgc ttaceggata cctgteegcc 



■zee 



4 ~~ 



gggaiagcgr. ggegctttcc catagcrcac getgraggta rcrcagttcg grgraggrea 46 
r e g e t e e a a g e 1 1 g g g e t g t g t g e a c g a a e e < r c c e g t t e a a c c e a a c cgc r g e a e e r r a t 4 ^ 



a a t aae r a 



:j r. c r r g a a t c e a a c e caa :aaaaca c a a 



a", cgc c a ctggcageaa 



cae r ac 



a a a e a a a a t a 



a • : : : a 



a ; : r 



* c : c r a e t a c a c r. .• ; a a a . : : : i 



— ;i -r 



a a * r a e 



: :aa a a a c; a a t r a a * a i c 



aa caaar'aadcr acccc 



BCJM41CT 



a a 



aatc:aaag': a z a e a e g a g e aaact: g g e c e g a c a g * z d c c a a e g c z. z 



agcgagg: a c c e a e c e c a gcga e c e :i z 



i:cca:aq:: acre a acrcc r> 



cg:cg:g:a ga:aac:.acg ataeggga gg g c::acca: c z a a c c c c aa: a c z a c a a e a a 



taccgegaga cccacgctca c:ggc:c:ag atttancagc aacaaaccag ccagccggaa 5 4 



gggccgageg cagaagegge c:rgcaa:tt eatcegccec ca::ca?!:cr at:aat:g:t 546 

geegggaage ragagtaagt ag:tcgc:ag :taatag::: gcgcaacgrr gtrggca::g 552 

c:acag?ca: eg zgg :g tea cgetc?: :g: ttggtatggc ::car.:cagc :ccgg::ccc 5 5 S 

aacgaccaag grgagttaca tgatc;c-:ca egeegrgcaa aaaagcggti agc:cc:tcg 5 6 4 

gLcctccga*: cgt:gtcaga ag:aagv;gg ccgcagtgtt: a:cactca:g gt:ar.ggcag 57 0 

ractgeataa zzczczzacz grca:gc-:at ccgeaagatg c:::t;c:gtg act jgtgag: 5 7 6 ■ 



acecaaccaa g::at:ctga gaataccgcg cccggcgacc gag^gctc: tgcccggcg 



aa:acggga taatagegea egacaeagca gaaceetaaa ag:gc:ca:c attggaaaac 



g::cttcggg gegaaaaetc ecaaggae cz :accgctgt: gagat;cc age tcgatgtaac E 9 



acecgtgc acccaactga tceecageae czzzzaczzz caeca gc gee :ctgggrgag 



^ 



caaaaaeagg aaggcaaaat geegcaaaaa agggaataag ggegaeaegg aaa»:gttgaa 60 60 

eaeecaeace ceeccteeee eaa eat: eaee gaagcaeeea ecagggeeat egtcecaega 6100 

g rg :ga tacat. atetgaatge aeetagaaaa aeaaaeaaae aggggeeecg cgcacatttc 6 1 s 0 

jregaaaage gccacctgta egeggegega aaeaeegcac agaegegeaa ggagaaaaea 6 0 4 0 

ccgeatcagg egaaattgta aacgttaata teeegetaaa atecgegeta aataettgte 6300 



aaa e cag- 



aeeeeeeaae caaeaggceg aaaecggcaa aazc^czzaz aaaecaaaag 536 0 



aa::\gac:ga gaeagggetg agegtegeec cagtteggaa caagagtcca ceaeeaaaga 64 
acgeggacec caaegtcaaa gggcgaaaaa cegeceaee.a agcicaatciac ccaceaeaea 6 4 



a a e ■;- a t c a c c c a a a e c a a a e 



g c g g e c g a g g e g c c g r a a a g c e c e a aarcqaaac e 



aaagggag ccceegaeee agagceegac ggggaaagee : : . * : a a c . r t gcaaaaaaae 5 



aagggaagaa agegaaagaa gegggegcea gggeger aac aa 



a a 



■_ ci e e a e a c e c a ■ 



e .it . 



a a 
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• - - • * " ' * g c c g r r r. c aa a a g c 0 : "a at. at.aaaaacct r. c t r c cac:a a • g : g : c a : : 
a g t g a c a r :agaaaac c g ac:g:aaaaa g:a c a g z e g g c a ::a::: c a r. a t r at :. a a a 



cag:ca:i aggcctatc: gacaa::cc: gaa:agag:: ca:aaa:aa: cctaca: 



j a 



ca:ca:a aacagaatga tgtacc:g:a aagatagcgg taaa:a:att gaat:acc:t 0 



i::aa:gaa :::: :c:gc: gtaataargg gtagaagg:a a::ac:,Jt:a ::a::ga:ar. 30 0 



:raagr.:aaa ccca^naaa: gaagtrcatg gaa:aataga aagagaaaaa gca:::';cag 3 
grar.aggr.gt t::gggaaac aa:rt::ccg aac:a:tata : f ::c:c:aca :cagaaagg: 4 



a:aaa:ca:a a a a c z c mtg a a g t c a 



tracaggagt ccaaatacca gagda"g: 



:agacacac: arcaaaaat:: gcaraaagtg gc::taac:: a;::ccaataa c::aa::ctc 



gr^att g:aa ::ag:*: ::aaaagc:g :a:::gagt: :a:caccctt gt:ac:aaga 6C 



aaa:aaa:g: aggg:aaaat ::aOa:cc:t cttg::t:tat g f ::t;cgg:a: aaaa-:ac:aa 6 5 
:atcaat:r: :gtgg::ata ccaaaag:cg :ttgt:ggct caaa:aa:ga craaa:at:t 70 



aa';:g:cr. aaat-:aa::: :a::aaagt: :d:ro;gata: gc::cc::aaa 



a aagtgaa:r;t aggaggctta c Z tg Zc tgc z tt;:t:::atta gaat.:aa:cc 8 4 C 



*:aa.aa gtc.aa:a:ta ctgcaacata aatatatatL :t;aaaaata: accacct^a 



z y w ^ 



c c a a 



i_ i_. t. 



j :t:g::gaac :aar. gggtgc :::ag::gaa gaaraaagac caca::aaaa 9 



aatgtg g 



tg'.gttt: :t:aaagga: :t.gagcg:ag cgaaaaat cc :r :. 



a:ct:ga:a ataagggtaa ctattgaa:: :ggtacc,aag ag:t:t;graga aacgcaaaaa 10^- 



ggccarccg: cagga: ggcc :r. :tgct:aa :t:ga:gccr. gqcag::;t.a: '?gcgggcgtc 



L C a U(. 



ca ccctccoaac c 



:g::g:::cg :aacg::caa arccgcrxcc 



aa:: n g 



Za.cz c a g gag a gcgttc accga a a a a c a a c a g a:a a a a c g a a a g g c c a a g 0 c r. zzca 10 



a <r t g agcc:: :cg::::a:: :ga:gcc:gg a a g::ccc:a 



a c a t: a a a a a a a c c a a 



a a c a a a c ..a _ c a a ■ 



a eg gcgr. ::ca 



a a a r. 



a *. : : a • 



.ki : jnuci 



a a 



a a c a 



a 0 a a a a c 



a 



a: • a a a ctaaaa a 



, \ a a a 



:t a a a a 



..l ;■ i -:i ci 1 



a a ca a a 



a ; i ; ! i - : ; ; a 



r :a* aaaa 
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a t: a a c a c c a c 



:ccccg:r.r. accaggaata a:atacg::: caaga:'j:gc ticcattatid 



i a 



::ar:cacaa r::g:ac::r. ::cair.agga agca:tccca cagcatraa: gaga:.c::ca 1^60 



c a a 



giaa :.gc:r.:ccac -acagrtcagg c::gc:tccg naacag:tgc c:t?:gaagr. 192 



:gccgc:ca :ca:tg:r:g aracatatra :a::::c:::c atttcrcgaa tatraacaa: 



9 3 0 



ga:a::a:c: a::aaacg-:g c:::cgaaaa agcaaccgra acagcgagaa t:a:cttccc 2 



a^ca^zzzca r.:cacaggct cgagttccgg ataggaa:aa agctctacat agt-:ta:gg 



caa:ga :a 



zzz^'s ag..:::::a:: a :cg :::cag gar 



a::ggcrrgg acaag:tccg cac:tgr^:g aagggcccga :acagct:ag g =*gc 

z .? c r. c c t c a cgc r.g::aagt a:acat z. gc :g ag a g c ctt: g g : t a age c g t; c ::cc r t: r. 2 2'^ I 

ga:cg:acc3 acajgaacca ar:caat;atc catgaagaag :-:gc:gatta acccancaac 2 34) 

aa:agc:ac: tgc:gcgca: c:t:taaac: gaaataggca cgag:cggci tgactaga:: 2 4 0'") 

gaaaagct:: g:::ag*;acga tcgcgaccc: g:caaaatg: cc::c:cttg agzrgcccgca 2 4 6:"; 



aa:acg::: g:gcg*:cttt ctacatgaa: cgtgaca 



cacegg ga:aca:aic 2 :> 



a:gagca"::t ggcgtaaaaa gaatatcgac :ccggcgt t: t :ctgcaagag ct:gcat;:ccg 2 5 # 
■::-:aarac:g cgcggatatg ct^caaaatc ttcartaggg ccgaattg:g caggai;:ca: 264:. 



aaaaataci': aLaa: : iacgg rgtcgt 



ttgt;ctcgc: mgtictgcra aggt;taaat:g 



etcatge ag.iaa-'ccca tegteggaac aaat: ::cgat:: geictitg- 



c t:gaa t; ggr. a 2 7 6 C 



::g:r.t:a:g gcttc::ttra gctgtgaaa: atcagtaatc t:gtctca:ct :a:t..:t.cccc 2R2 0 
■:g:acaagcc gtcaagcact gtctgg::ca tttgaaagga atgcttttgt tcaggqaaag 2 8 ^ C 
racgatjee: :aca:cctga aca:a:ccgc tga:.r.gctg: ::tcgatggtt tcatcaa:gc 294C 1 



gcg:atar:g cctcaoaaat t:agg:g:" 



^ — ■ ^ — L- C 



aacacc gtggccgata a:.atcatga: 



aaacgagaac t:g:ccg:cc gc::tcacac cagccccga: ::ccaatgacc gg:a:gc:ta 3 06C 
gcgtct'rggc aa::: f ggc: gtgag::c:g ccggcacaca r:ccagcaca agca:ca:ag 3122 



czcczqczzc ::cgcat::.t: a:ac:gr 



c z a t. r. a a 



zqacqczz. :ci::ca:cr.r 3 1-3 



gece ta:a> 



a z a ' 7 c c g c c cagt a c 



ric cgac z. gac r g rag*;gtcaaa cctaag:gac 



j ~ ^ 



a . aa cia ac 



a ac;cg:caa:a racgaatg^a 



a a a c ac a 



: a a 



r. c a a 



a ca 



a a a a c a ^ a a 



a z. c a * 



a at. a : i : 
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3. t t t t. t ci CT a .3 a ci t C C ci 



".r a a c a t c c r ci c r a 



a a gacc.str.ara ccacacc::a t a a a t a a a rr t caa c aac:: 



7i c a a a a t t :::caaaaar. 



aacaaaa 



w ^- — < — ^ 



t ct g a r_ q r_ a a aaca a a a c a t c t * t 



gc:c::g: gc:g:ragga rate 



tctt ggaagcragg taggecrega grtarggcag 



::gg:raaaa ggaaacaaaa agaccgtttt caracaaaac ggrctttttc gat ttc trtr 3900 
t acagtcac.fi gc:acrr::g :aaaaa:cgg acagcttrat g:c:rataac tggctgtttcg 196 2 
gtcgacaagc ttegegaage ggccgcaaaa trcactggcc grcgrttr a-: aacgtcgtga 4 22 2 



gggaaaac rctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 



grgtaat agrgaaqagg cccgcaccga :-:gccct.r.cc caacagttgc gcagcccgaa 



1 i 

1 ^ ^ 



ggcgaatgg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg 4 2 0 n 
atatggtgt attetcagta caatctgctc tgatgecgea tagttaagee agccccgaca 428;: 



C'::gc:aa:a rrcgrt grace atgctrgtaa arcgttttgt gaaaaaa 



1 1 a a a a t a a a A < 



aa :ig gggao r tctaggggtcc ccaattaatt agtaatataa tctattaaag gtcattcaaa 43^: 



aggtcatcra rrggatcagc ttagtaaagc cct-:gctaga ttttaatgcg gatgt. tgega 



^± ^± \ 



t tac t tegee a ac tattgeg ataaca agaa aaagecage t: t teat gat a ta tc t c t caa 4 b 0 0 
tttgtgtagg gectattatg cacgc::aaa aataataaaa gcagactcga cctgatairjtt 45 6 0 
tggctgtgag caattatgtg ettagtgeat c:aacg::r„g agttaagccg cgecgcgaag 4 62C 



cggcgtcggc ttgaacgaat tgttagacat tatttgeega ctaccttggt gatctcgcct 46 



SO 



teaegtagt ggacaaattc ttccaactga tctgcgcgcg aggtcaageg atcttcttct 474 



grccaagat iagectgtct agctt:caagt atgaeggget ga taetgggc cggcaggcgc 



_ '._ 1. c 



rattgece agreggcage gacatcct. tc ggcgcgattt tgccggttac rgcgctgt 



aaargcggg acaaegtaag cactacattt cgctcatcgc cagcccagtc gggeggegag 4 90 
t ccatag c a *: r a a g g 1 1. 1 c a 1 1 1 a g c g c c tea a a t a a a :. c e t g 1 1 c a g a a a c e q a a tea 4 ? 



« i Is T 



:a e c t a l a a a cr c a a c a c t a r t t e t c 1 1 ct c 



cl 



aaciaragcca gatcaatgre garcg: gcicr ggc t caaaaa taecrgeaag a a * t * • ♦ : * 



a r. r 



aaa r t caa t 



t t a a 



: ■: : a *: aaca c c a c a a a a * : a 



J - v. d _ .1 -.-1 



r acaac :j en era aaa : 



i <. i 1 1 ■_ j 



idrl : . " 



■ i ; 1 I 
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gatagttgag ccgatacttc ggcgatcacc gctrccctca ngatgtntaa ct:tngf 



CI ^ 



a 



ggcgacrgc cctigctgcg: aaca:cgt:g c:gc:cca:a aca:caaaca :cgacccdcg to.: 



acg*:aacgcg cccqcrgct: ggatgcccga ggcanagac: g:accccaaa aaaacaqcca t 



taacaagcca tgaaaarcgc cactgcgccg ccaccaccgc tg:g::cggt caagg:rc:g 



gaccag::g: qtgagcgra: a:gcta:::g cat:acagc: targaaccga a:ag?c::ar. 
g:ccac:ggg ttcgtg-::rt -:a:-:c?:::c cacgg:g:gc gteatetgge aaccttgggc 



ageagegaag tcgaggcatt ::^::c:gg ctggcgaa-:g ag:g:aagr- 



:cq gte t 



acgi:a:cgtc aggeattggc ggc:::gc:g :tc::ctacg g:aagg:g-:^ gtgeacggat : 



par 



c:gc-::ctggc ttcaggagat e ggaagac-:: cggccgt-: ge gg:g :tt gee ggtggtgetg 



= ■9 4 



accccggatj aagtggtttg :at:ctcggt tt:c:ggaa:j g:gag:atcg tttgtt7ge:: f 0 0 C 

c .agc::ctg:. atggaaeggg catgcgga:-: agtg-agggtr. tgcaactgcg ggtcaagga: 0 0 6 0 

c:gga:tt-:g atcacggcac gattategtg cgggagggca a ggge tccaa ggarcgggcc 6 1.20 

::gatgttac ccgagajctt ggeaeecage ctgrgcgagc aggg jaattg ate :: ggt gg a t'lSo 

tgaeettttg aatgacctt: aa^agatta: attactaacr aatt jggga-: crtagaggtc fc-2 4 0 



■_- _ i_ 



ta tttraaaaa: tt:ttcacaa aacggtttac aagca:aacj ggt:utgctg (".300 



gcaaacg ggetgttctg gtgttgctag tttgttatea gaategcaga t-eggcttea h 3 6 



ggtttgccgg ctgaaagege tat t tc t: tec agaattgeca tgaf:::ttc cccacgggag 6 4 2 ;» 
gcgtcactgg ctcccgtgt: gteggcaget ttgattcgat aagcagcate gcctgt::ca 6480 
ggctgtctat gtgtgactgt tgagctgtaa caagttgtct cagg:gttca atttcatgtt 654 J 



c 



.agttgctt tgttttactg gtttcacctg :tctattagg tgt:acatgc tgt:ca::tg 6 6 0 



ttacattgtc gat:tgttca tggtgaacag c: t taaatgc a :ca aaaact egtaaaage: 6 6 6 0 
crgatgnacc :atc:ttttt acaccg::tt catctgtgca tatggacagt tttccctttg 672:; 
a:a:c:aacg gtgaacagtt g:tctac:t: tgt:tgtcag :::tga:g:: tcactgatag 6o0 



atacaagagc caraagaacc t c a g a 



cgtatttag c 7 a g t a t g 1 1 ctctagtgtg 



ct teat ta 



ttgcgtcag cratgagaac gaaccactga 



a t a c 1 1 a c 1 1 1 q c a t g 



a c t e a a a a a 1 1 1 1 a c c t c a a a a c t g g t g a g c t g a a 



aattaa aacategtat - * - 



ta 



tt acataagtag gaatctgarg taatggttg 



a cr t a 1 1 



"caecatrca tttttatcta gttgttctca ag::cgg^'a c gaga treat 



a a a a a a t r a a r - a t a t e a a t c 



: ? 7 7 a t e a a • 



BGI-14KT 





c:a:a_:rg :c::g:gag: :::c::::g: g::ag::ct: zzaaza^cc^ c:ca:aad:c 



* •" j * a ::i a gr. a:":^::::.c aaaagac:.:.a aca:g::cca gar.: a 



:l ... ^ _ c 



t z a a a 



3 - 



aa::gga aaaga:aagg caata^ctc: :cact.aaaaa c:aa::ctaa t 



gc 



?agaac:t gg catag:cig: -::a::ggaaa atctcaaagc c:::aaccaa agga^:cc:g 7 : : g 
arrrc - acag ttctcgtcat :agctctctg gttgcttrag ctaatacacc ataag-rattt 7 6 6 
:cC'?:a-::ga :g:rca:cat gagcgtat nggttiataag rgaacgatac cgzzcizzcz 762 



c 



7gt ag ggt t_ 1 1 caat cgiggggt r_g agta 5 tgcca :a:agca:aa aatcagctrg 7 6 8; 



?ct:-:a':g-:: ccgttaagtc ataagcgacta atcg:tag:t :a:ttgc:rt gaaaa:aa:t 774; 

aat::agaca tacatctcaa r tggtcta gg tgatrtitaac ca:r.a:acca attga ga: gg 7 3 ,g ; 

gctar-'-aa: gataattact ag:cctt::c cr,tt gagttg : gggtaatctg taaa::ct gc 17 - 6 ( 

taga-r-rrrrg ctggaaaact: f;g:aaattc: gctagaccct ::gtaaattc cgc_a?acrn 791:-; 

1 r: gt g:g: z z t:tttgt::a t;a::caag:g gt:ataat:t atagaataaa ga aag aata a 7 9 ^ C 

aaaaagataa aaagaataga t;c>:cag-::cc gtgtacaact cactacttta g: cagi t: ::c g 8J 4 ' 



■:agr.a:t:a«:a aaagga tgcc gcaaacgc:g t r, tgctcctc tacaa aacag acc t raaaac 8 1 



1 



■:o:aaaggcr. taagtagcac ■rcrcgcaagc tcgggcaaa: cgctgaatat tccf:::;g:c h16C 

^ccgaccacc: aggcacctga g:-.;gcrg:c: :t:tcgtgac a*:rcagttcg c:gcgcr:cac 31l_C 

.jgcroiggca g:.gaatgggg g:aaa:ggca ctacaggcgc c-.:tta:gga ttca':gcaag %2'-'Z 

gaaactaccc ataatacaag aaaagcccg: cacgggc:tc ~ oagggcgt t ttiatggcggg ^ -i 

tctgctaigr. ggtgctatct gacttt:tgc tgrrcagcag f.cctgccct ctgat::;:cc S40C 

agtcgacca ctticggatta zcccgzqacci ggtca:tcag ac:ggctaat: gcacc«:ag:a .346';: 



aggcagcggr. atcatcaaca ggc ttacccg :ct:ac:gtc aac 



'j ..1 j 



BGM41CP 



atgaa:::cc c:gc:g:dar aatgggtaga agg:aa::a: ta:ta:ta:: gata: 



aaacccag :aaatgaagc c:a:.ggaat.a a:agaaagag aaaaagca: 



caaciiat a 



ggtgttttgg gaaacaattt ccccgaacca :tata::tc: cracarcaga aaggta:aaa 4 2 0 



ca:aaaac: ctttgaagtc a:tcttta-:a ggagc:caaa caccagagaa tgttttagat 43" 



aeaee 



atcaa aaattgrata aagiggcict aacttarccc aataaccraa crcrccgtcg 5 4:': 



c:a:tg:aac cagttctaaa agctgtattt gagct:atca eecttgtcae taagaaaata 600 
aatgcagggt aaaatttata trcttcttgt tttatgtttc ggta:aaaac actaatatca 660 
atttctgtgg ttatactaaa agtcgtttgt tggtccaaa: aatga::aaa tatctc:::r 7 2 0 



ctcitccaat tgtctaaatc aattittatta aagttcatt: gata:gcctc ctaaat:: 



atctaaagtg aatttaggag gcttacttgt ctgct:tctt cattagaatc aatccttttt 8 4 C- 

taaaagtcaa tattactgta acataaatat atattttaaa aatatcccac tttatc-:aac 9 C 

tt:cg:::gt tgaactaatg ggtgctttag :tgaagaata aaga::a:at taaaaaatj: 96j 

ggtcctttgt gtttttttaa acjgatttgag cgtagcgaaa aatccttcc tttcttat;:t: 1 C) 2 0 



gataataag ggtaactatt gaattcggta ccaaga^ttt gtagaaacgc aaaaaggcca 



tc-:g:-:agga :ggcc:tctg cr.taattL ga :gc :tggcag t:taiggcgg gcg:cc:gcc 114 
•:gccac-:c:c egggccgttg cctcgcaacg zz-z.~Lcia.zc eg c:c:cggcgg att:gtcc:a 12 0 



a < 



ggagag cgttcaccga eaaaeaacag a:aaaacgaa aggcccagtc tttcgacr.ga 12 6 



gcct::cg:t :ta:::gatg cctggcag:: ccctactetc gcarggggag accccacact: 1320 

accatcggcg etaeggegtt :cacttctga gttcggcatg gggtc-aggtg ggaccaccgc 13 30 

gc:ac:gccg ecaggcaaat :c:tgt:t:a: cagaecgett c:gcg:t::tg at::aatc:g 14 4 0 

:a:caggcrg aaaatettet ctcaccc^c: aaaacaggat. C'rracggaaa tggageggea 1500 



aaaccgtttt actc:caaaa tcttaaaaga aaacccccga taaagggggc :t::cttc^a 



eaaaa z tg:a eggge tgg z 



ccag ca::c g t t e a a::::g 



ga^catt c a g _l c _ 0 



aacagccac: :tcggc:ca: ggc:.:gccgc v 



gate a gacatca':::: "gtaggaaa: 1630 



a at a a :aac 



g e a c a a g g c g:gc g get g c ace g 1 1 1 a age a t a t g a e 



1 ^ 



geegetteec egtttaeeag gaataatata egttteaaga ea tac tee at t a 1 1 a 1 1 a r. 



aeaa 1 1 1 a t ac 



1 1 1 1 e a z t a gg a a g e a t t e e e a e a a e a t e a a t a a g at e t r. c a t e a a t 1 ^ 



..■i V. * * _. ^ . -4 
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a:a 



a a .. -n 



ca7.7cac aggctcgagt: Z cc gg a ccagg aataaagctc: :aca: ag' :t a:ggt.:/;cqc 1 . 



re a cj 7 



tic a a z cr a 7 a z c z zzz c z a ■ 



::a::ac 



a::: c 7 7 t c a c c c: ct _ 



c::ggacaag :::cgcac:: gt::gaaggg ::cga:aca g c ::agg:gc: t:::: 7 c ^ 
cctcag:: g: taag:a:aca :cgcgagagc ::::gg::aa gccgr. ::::c Lc:c:.ga c c g 2 



tatcgacagg aaccaar.:ca ata:c:a:ga agaagtcgcc ga::aaccca tcaacaacag 



_j ^ ^ 



::acctgc:g cg-:a:c:::t aaaccgaaat aggcacgag: cggcttgacn agactgaaaa 2 



■gnrag ra:ga:cg:g a:cc:g:caa aatg:::::c tcttgsgcgc cc}ca:aaca :: 4 6 3 



cgtctrgt gcg 



rc z c 



aca :-gaa:cg*:ga cattc: 



accggga:ac a:atcatgag 



■at:c:gg-:gt aajaagaa:a tcgactccgg eg z z' : *: :; 7 gc aagag^rtgea tcrcgcr.caa 2 



tat:;gcgcgg a:atgcttca aaatc:tca: ragggecgaa t tigtgcagga ttcacaaaaa 2 



actcataat aaeggegreg ttttcrtgtc ctgct:::gtc tgctaaggtt aaatgcccrc 



'-\ ^ < , 



catgragaaa :c::a 



. — 



ggaa:aaatc cgatrga:ct gece: ctgaa tggtcan tegtc t 2 



::at: g gc 



agecg: gaaaratcag ?.;aa:c:g:ct ca:c::at": tccccrgta: 2 3 2 



aagc:gt:aa g:acr.g:::g g:tca:t::ga aaggaaege 



g c-.cagg gaaagracga 2 * 3 ; 



3~ 



a: -re rc gaacat a :ccgct:gatt grcgtticga 1 g gt t c;a He aa:gcgcg:a 294 



at:gct.'_ta caaat7.7agg :g:t:tc:;ca acaccgtggc :gataa::atc atgataaarg 



agaacttgtc cg:ccgc:t: cacaccagcc :oga:tccaa rgacrggtat gctr.agcg:c 306 
r:gg-:aa::i tgg:::g:gag :tctg-:cggc aoac a 7 7 :ca gcacaagca: catiagctc:: 212 
g:::::tcgc a n t. r; nanac n g::t:ctatt aa:::ct:gg cgczzzqzzc gtc:t:gcc: 218 



g:a-::t:a: agccgcccag tacgccgact gaetgeggtg :caaacc:aa grcgacT-gacTc 2 2 



a rtggaargc ::ccaagc j: caatgegoga a:ggat:.caa acacgcc::c :ccgccc:ca :1 



agctctcagtg :gtcagc:-:c gczz Zee z ga aegaragecg c:gcatr.ttc cagegtate: 3 



< m 



a gaca ggrgataaga ca: a a a c g g c ara:c:g:ca c a a 7 a aagg r. a 1 rc c g g c g c a 3 4 2 



ccc 



a a c ggc zzz zgz at gat: agate a r. g: ccgcaa cr.gtcacacc gacagr. 



ci a a 3 4 ~ 



c a a ct a c c a a a a a c a a c 



ccaaataaa 



i c c :. i ,• *" a a a a *~ c 



a a c 



acicaa 



a a ■ 



-4 l _ V_ 



a a a z a a t c a * a a a c c; a t.ca 



: a c a a z 



a * • 



:? a c 



ttaaaaa a:c o a : * 



a a a a 1 " _t 



c 7 ct c t c;t 



a a a a 7. at. car. a rcac a c • ■ • * a * a c rata a a c: c v. - a a c a a c t 7 7 7 7 ct c a a 



BCII- I4ICT 



140 - 



a cajc c a c 



tgcaaaaa c c a ciaca a c: :ca: a cc::a * a a c t a 



~z) J ^- — - 



t ::c agac at g:aaac::ca caaaa:aac: 



a a c ■:: c a a 



a t :;a c 



:a:ga::c:: c:cgc::ccg g:ggca::gg gatgcc c g c g :: g c a g gc c a t g c :g:c c a g 4 J - 
g:agg:aga: gacgaccatc agggacagc: t:aagga:cg c„cgcggc:c t t accagcc t. 414 



a ac 



gate a::ggaccgc :.ga:cgt:ac ggcgatttat gccgcc:cgg cgagcacar.g 41 



gaacgggrcg g:atgga::g tagg 



g:cgc cctatacctt g:c:g:::cc ccg:g::gcg 



gcggtg-ra tggagc rggg :ca 



a ac 



gaacggaa gccggcggca cc:;gc: 



a a c 4 31 



gga:::ac-:a etecaagaat tggagccaai -raatrcttgc ggagaartgt gaa:gcgcaa 43 SI 



ac raaccc ct ggcagaaca: a::cat:gcg :ccg:ca:cr ccagcagccg ^'dcgcggcyc 



^ 'i 



ate tc gggca gcgttg ggtc ctgg rcacgg gtgegcatga ^cg^g:::c: gtcg:tgagg 4 5 



ac re g gc t ag gc t ggcgggg t t g. 



?gtt agcaga argaatcacc gaiargcgag 



1 -J w 



cgaacgtgaa gcgact j:tg -:tgcaaaacg :ctgcgacct gag:aacaac aigaatggtc 461 



■j 



■ggtttcr gtgntt-rgta aagt:t:ggaa aegeggaagt: :^g:g-:cctg race at tat g 46^0 
eggatr: grategcagg atgctgrtgg ctaccctgtg iaa-"acct ae atctgtatta 4740 



argaageget ggcattgacr ctgagtgatt tttctctggt crcgccgcat ccataccgcc 48 



agttgtttac cctracaacg ctrcagtaac cgggcatgtt catratcagt aacccgtatc 4^ 



gt gage a tec tctctcgttt ca teggtate attaccccca tgaacagaaa ttcer rc tta 4 9 1 0 

caeggaggea tcaagtgacc aaacaggaaa aaaccgccct: taacatggcc cgc 1 1 1: a t ca 4 9^0 

gaagecagae attaacgett ctggagaaac tcaacgagct ggacgeggat gaacaggcag 5 0 4 0 

acatccjtga -it cgc t tcac gaccacgctg atgagctrea ccgcagctgc rtcgcgcgtt 5100 

tcggtgatga :gg:gaaaac ctctgacaca t.gc age tec c ggagaeggre acagcttgtc 5^60 



gtaagrgga tgeegggage agacaagccc gtc agggege gtcagcgggt gttggcgggt 



g t c g g g g c g c a g c e a t c r a c c c a g t c a e g t a g c g a t < a g c - g g a g " g t a t a c r g g err. a a c r. a 
tgeggcatea gage agar, tg cactgagagt gca':e;earg c ag ca t gaaa taecgcaeag 



a:a;;a:aaag agaaaa' acc acatcaacica 



a c"Cc\c 



a t caae teae 



4 j ■ T T T T 

*. C 1 ^ s. ^ v.. . j ■.. i 



lata c a a 



aeaa 



ra 



: c a a a a a a e a *: a * : a a 



4 ' i : 



: • : aaa a e 



1 i T / 
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t 

cggatac c tg :c:^ccr.t:c tcccttcg g g 
r a ci g :a:c:c ag:: c g g r g r agg:c g r r. c 
cg::cagccc ga::gc:gcg cc::a:ccgg 
a c a c g acr:a t c gc r a cngg c agcagcca c 
aggcggrgcr aca:ag::c: tgaag:gg:g 
a:::gg:ar.c tigcgctctgc tgaagccag: 
arccggcaaa caaaccaccg crggtagegg 
gcgcagaaaa aaagga:c:c aagaagatcc 
gcggaacgaa aacteacgr. r aagggatr.:: 
c:agarcct: ttaaa:taaa aatgaagttt 
ttggrctgac agttacraat gctraatcag 
:cg::ca:cc atagttgcct gactcccegt 
accatctggc cccag:gctg caatgatacc 
a t c a g c a a t a a a c: c a gccag ccggaagggc 
cgrctccatc cagt;ctatta attgc^gc^rg 
tagrttgcge aacgtitgtiLg ccd:Lg>:tgc 
t at: ggctt :a :tcagct::cg gtteeraacg 
g:g:aa-aaaa g:gg:tagct :cttcggt:cc 
aglgt:atca crcatggtta :ggcagca<:t 
aagat:gc:tt tc:gtga::g gtgag:a:*;c 
gcgaccgag: tgc^cttgc: cggcgreaar 
r.":aaaag:g c:catca:tg gaaaacg:"c 
gc:g::.gaga :ccagt!:cga "gtaac c c a r 
r a c r. ^ r c a c c agcgttitctg g g r g a g c a a a 
a a r. a a g g g eg acacggaaal g r r g a a t a c t 




aagcgtggc g c::t c r c a r a g c r c a c g c t ct 5 7 6 0 
ctccaagctg ggcigr.gtr.gr acgaac : '''r r : - 0 2 
r a a c r a r c g r c::gagtcca a r c c g g r a a g 5 - - 0 
tggtaacagg atrageagag rgaggrargr 5 94 0 
gceraacrac ggcraracra gaaggacagr. 6 0 0 0 
racer regga a a aa gage, rg gragcrcrrg 6 06 0 
tggrrrrrrr grrrgcaagc agcagar. r.ac 6 120 
rrrgarcrrr rcracgrgggt ctgargctca 61-^3 
g g r c a r g a g a t r a r c a a a a a g g a r r r r c a c 6 0 4 0 
taaatcaatc raaagrarar atgagraaac 63 00 
tgaggcacct atetcagega rcrgrcrarr 6360 
cgrgragara acracgarac gggagggcr. r 6400 
g::gagac:ca cgctcaccgg crc:cagarrr 64 30 
egagegcaga agtggtcctg caaetttarc 6 54 0 
ggaagcraga graagragtr cgccagrraa 66 00 
aggca:cgtg gtgtcacgcc cgcrgttcgg 6 6 60 
a::aaggcga rgtracargat eccccatgrr. 6 7 CO 
rrcgarcgtt grcagaagra agrrggcege 6 7 ^ 0 
gcataartct ctractgrca tgccatccgr 6340 
aaecaagtca rrcrgagaat agtgnatgcg 69 00 
a c gg g a r a a r -a c cgcgccac a r a g c a g a a c 6 9 6 0 
rreggggega aaactc:caa gga net race 7 00 0 
rcgrgeaece aaergarerr cagcarerrr. ~ : - : 
a a c a g g a a g g e a a a a r g e e a c a a a a a ci a a a ~ ^ 4 0 
catacctr: e r r r r r e a a r a r r a r r. a a . i a ^ 2 .0 0 



